DO^l'MF^T 



R F S I' M F 



ED 021 473 64 EM 006 151 

By* Nelsoob Carl B. 

THE EFFECTIVEhCSS OF THE USE OF PROGRAMED ANALYSES OF MUSICAL WORKS ON STUDENTS’ 
PERCEPTION OF FORK FINAL REPORT. 

State Umv. of New York, Cortlakod Coll, at Cortland 

Spons Agency* Of fice of Education (DHEW), Washington, DC Bureau of ResearcK 

Bureau No* BR*5* 1084 

Pub Date 16 Dec 67 

Contract- OEC* 1*6*051084*0752 

Note* 233p. 

EDRS Price MF-$1.00 HC-S9.40 

Descriptors* AUDITORY PERCEPTIOH AURAL STIKJLI, COLLEGE INSTRUCTIOH INSTRUCTIONAL MATERlA\LS 
LEARNING THEORIES. LINEAR PROGRAMING. *MUSIC EDUCATION •PROGRAMED INSTRUCTION PROGRAMED 
MATERIALS *TAPE RECORDINGS TRANSFER OF TRAINING 
Identifiers* Beethover\ Haydn, Mozart, Rimsky Korsakov, Smetana^ Tchaikovsky 

Linearly programed materials used with musical references on magnetic tapes 
were compared to conventional listening materials used outside the classroom and to 
assignments for a college music course. The point of the comparison was to ascertain 
whether or not the programed materials significantly improved students’ aural 
perception of the structural components of form relating to specific, musical 
compositions. Over 300 non-music majors in three classes were rated at one of 10 
levels according to their performance on a music achievement test and then assigned 
at random to six treatment groups and a control g'"Oup for each class. The evaluative 
devices used following treatment were tests for five specific skills, a fo5low-up test for 
transfer, and a preference inventory. Results indicated that the programed materials 
increased the students’ perception of musical form m the works studied to a 
significantly greater degree than did the conventional materials. But there was no 
significant indication that the programed materials led to greater transfer of learning 
than did the conventional materials. Preferences for particular compositions were not 
affected by studying the programs. The students indicated on a questionnaire that 
they preferred the programmed materials to the conventional ones. Experimental 
materials are appended. (dB) 



o 



FINAL REPORT 

Project No. 5-1004-6''^^ 
Contract No. OEC-1-6-051084-0752 



THE EFFECTIVENESS OF THE USE OF PROGRAMEO 
ANALYSES OF MUSICAL WORKS ON STUDENTS’ 
PERCEPTION OF FORM 



December 16, 1967 



U.S. DEPARTMENT DF 
HEALTH, EDUCATIDN. AND WELFARE 



Office of Education 
Bureau of Research 



£ 002147 } 



U.S. DEPARTMENT OF HEALTH, EDUCATION & WELFARE 
OFFICE OF EDUCATION 



THIS DOCUMENT HAS BEEN REPRODUCED EXACTLY AS RECEIVED FROM THE 
PERSON OR ORGANIZATION ORIGINATING IT. POINTS OF VIEW OR OPINIONS 
STATED DO NOT NECESSARILY REPRESENT OFFICIAL OFFICE OF EDUCATION 
POSITION OR POLICY. 

Final Report 

Project No. 5-1084 
Contract No. OEC-1-6-051084-0752 



The Effectiveness of the Use of Programed 
Analyses of llusical Works on Students' 
Perception of Form 



Carl B. Nelson 

State University of New York 
College at Cortland 

Cortland, New York 

December 16, 1967 



Tne research reported herein was performed pursuant to a contract 
with the Office of Education, U.S. Department of Health, Education, 
and Welfare. Contractors undertaking such projects under Govern- 
ment sponsorship are encouraged to express freely their profes- 
sional judgment in the conduct of the project. Points of viev; or 
opinions stated do not, therefore, necessarily represent official 
Office of Education position or policy. 




U.S. DEPARTMENT GF 
HEALTH, EDUCATION, AND VIELFARE 

Office of Education 
Bureau of Research 







TABLE OF CONTEIJTS 

Page 

List of Figures iv 

List of Tables ^ 

List of Experimental Materials 

Acknowledgments 

Summary ^ 

CHAPTER I INTRODUCTION ^ 

Statement of the Problem ^ 

Review of the Literature 8 

CHAPTER II 14ETH0DS 11 

Selection and Description of the Sample 11 

Experimental Treatment 1-3 

The Experimental Materials 1^ 

Procedure 1^ 

Time Schedule 20 

The Criteria 21 

The Achievement Tests for the 

Assigned Works 21 

The Follow-up Achievement Tests 25 

The Elementary Achievement Test 26 

The Preference Inventory 27 

The Questionnaire 29 

Controls 30 

Summary 31 

CHAPTER III THE Ai^ALYSIS AND FINDINGS 32 

Description of Classes : EHAT Scores 32 

Analysis of Achievement Test Scores 33 

The Test of Hypothesis la ... 35 

The Test of Hypothesis lb ^2 

The Test of Hypothesis Ic 

The Test of Hypothesis Id 

Comparison with General Control Groups ... 50 

Time Scores 30 

The Analysis of Preferences 53 

The Results of the Questionnaire 57 

'jummary ^3 



- ii - 



Page 

CHAPTER IV SUliliARY AND RECOIUiEilDATIOiiS 66 

Review of Rationale 66 

Features of the Experimental Materials 67 

Recommendations for Further Research 69 

Use of Programs 69 

Measurement of Aural Skills 70 

Transfer of Aural Perceptions 71 

Perception Differences 71 

Preferences 72 

Summary of Findings 72 

References 75 

Appendix A 76 

Appendix B 1^6 



- iii 



LIST OF FIGURES 



Figure 

1. Schematic of Experimental Design 

2, Comparisons Ilade to Test Hypotheses la, lb, 
Ic , and Id 



Page 

17 

34 



iv ~ 



LIST OF TABLES 



Table Page 

1. DISTRIBUTION OF ALL STUDEOTS 

EMOLLED IN IflJ 112 SPRING SEMESTER, 12 

1967, ACCORDING TO MAJOR FIELD OF 
STUDY AND CLASSIFIGITION 

2. DESCRIPTION OF ALL CLASSES BASED 

ON EMAT DATA 32 

3. NUMBER OF SIGNIFICAilT DIFFERENCES 

FOUND BY THE TESTS OF HYPOTHESIS la 36 

BEWEEN THE RELEVAilT CONTROL AND 
EXPERIMENTAL SUBGROUPS GROUPED BY 
COMPARISON (ALL CLASSES) 

4. NUl'IBER OF SIGNIFICANT DIFFERENCES 
FOUND BY THE TESTS OF HYPOTHESIS la 

BETWEEN THE RELEVANT CONTROL AlH) 38 

EXPERIMENTAL SUBGROUPS GROUPED BY 
CLASS (ALL CLASSES) 

5. NUl'IBER OF SIGNIFICANT DIFFEIUSNCES 

FOUND BY THE TESTS OF HYPOTHESIS la 38 

BETWEEN THE RELEVANT CONTROL AtH) 

E:CPERIMENTAL subgroups grouped BY 
COMPOSITION STUDIED (ALL CLASSES) 



6. SPECIFIC SUBTEST-COMPARISONS YIELDING 

SIGNIFICANT DIFFERENCES TO SUPPORT 44 

HYPOTHESIS lb 

7. SPECIFIC SUBTEST-COMPARISONS YIELDING I 

SIGNIFICAI^T DIFFERENCES TO SUPPORT 45 

HYPOTHESIS Ic 

8. SPECIFIC SUBTEST-COMPARISONS YIELDING 

SIGNIFICANT DIFFERENCES TO SUPPORT 48 

HYPOTHESIS Id 

9. GRAND MEANS OF ALL OBSERVATIONS FOR ’ 

TESTS OF EACH HYPOTHESIS COLLAPSING 49 

THE THREE CLASSES 

10. MEAN TIME SPENT BY ALL STUDENTS IN 

PREPAR/iTION FOR ACHIEVEMENT TESTS 52 



- V - 









1 




Table 



Page 



!. '! 





11. PREFERENCE RAi'TKINGS OF PROGRA. a.ATIC 
AIR) ABSOLUTE COIIPOSITIOIIS 0? STUDENTS 
GROUPED BY CLASS 

12. iJEAN SCORES OF EXPERI’.IENTAL AND CONTROL 
GROUPS EXPRESSING PPJEFERENCE FOR 
PROGRAi’illATIC VERSUS ABSOLUTE 
COilPOSITIONS 



54 



55 





13. MEAN SCORES SHOWING DIRECTION OF 

PREFERENCES FOR ABSOLUTE VERSUS 55 

PROGRAIlilATIC COMPOSITIONS OF EXPERriENTAL 
STUDENTS GROUPED BY NUl^RiER OF PROGRAMS 
STUDIED 









14. MEAI^ SCORES SHOWING DIRECTION OF PREFERENCES 

OF EXPERIMENTAL STUDENTS FOR ABSOLUTE 56 

COilPOSITIONS STUDIED WITH EXPERI. lEMTAL 
ilATERIALS VERSUS THOSE STUDIED 
CONVENTIONAL MATERIALS 

(APPENDIX A) 



15. TEST OF HYPOTHESIS la; COMPARISONS OF 77 

HARMONY AND TONALITY SUBTEST liSAl'IS 

IG. TEST OF HYPO'THESIS lb; COMPARISONS OF 73 

HARMONY AND TONALITY SUBTEST MEAI-IS 

17. TEST OF HYPOTHESIS Ic; COMPARISONS OF 79 

HARMONY Ai^D TONALITY SUBTEST MEMS 

18. TEST OF HYPOTHESIS Id; COMPARISONS OF 30 

HARMOOT Ai-ID TONALITY SUETEST MEANS 

19. TEST OF HYPOTHESIS la; COMPARISONS OF 81 

KNOWLEDGE OF FORM SUBTEST MEAI'IS 

20. TEST OF HYPOTHESIS lb; COMPARISONS OF 82 

KNOTJLEDGE OF FOPxlI SUBTEST MEANS 

21. TEST OF HYPOTHESIS Ic; COMPARISONS OF 33 

ia;«CT‘JLEDGE OF FORM SUBTEST MEAl^S 

22. TEST OF HYPOTHESIS Id; COMPARISONS OF 84 

KNOWLEDGE OF FORM SUBTEST MEANS 





- vi - 










Table 



Page 



23. 


TEST OF HYPOTHESIS lai 
IIELODY SUBTEST MEANS 


COMPARISONS 


OF 


85 


24. 


TEST OF HYPOTHESIS lb: 
MELODY SUBTEST MEAi'JS 


CO^IPARISONS 


OF 


86 


25. 


TEST OF HYPOTHESIS Ic: 
MELODY SUBTEST MEANS 


COMPARISONS 


OF 


37 


26. 


TEST OF HYPOTHESIS Id: 
14EL0DY SUB1T.ST MEANS 


COMPARISONS 


OF 


88 


27. 


TEST OF HYPOTHESIS la: 
RHYTHM SUBTEST IffiAi^S 


COMPARISONS 


OF 


89 


28. 


TEST OF HYPOTHESIS lb: 
EHYTHII SUBTEST MEANS 


COMPARISONS 


OF 


90 


29. 


TEST OF HYPOTHESIS Ic: 
RHYTIEl SUBTEST I4EA1^IS 


COI'IPARISONS 


OF 


91 


30. 


TEST OF HYPOTHESIS Id; 
RHYTffi'I SUBTEST MEAi^S 


COMPARISONS 


OF 


92 


31. 


TEST OF HYPOTHESIS la: 
TIMBRE SUBTEST MEANS 


COUIP ARISONS 


OF 


93 


32. 


TEST OF HYPOTHESIS lb; 
TII4BIUS SUBTEST MEANS 


COI^PARISONS 


OF 


94 


33. 


TEST OF HYPOTHESIS Ic: 
TIMBRE SUBTEST MEANS 


COMPARISONS 


OF 


95 


34. 


TEST OF HYPOTHESIS Id; 
TIMBRE SUBTEST MEAI4S 


COMPARISOkJS 


OF 


96 


35. 


TEST OF HYPOTHESIS la: 
AUDIO PORTION ONLY 


COMPARISONS 


BY 


97 


36. 


TEST OF HYPOTHESIS lb: 
AUDIO PORTION ONLY 


COMPARISONS 


BY 


98 


37. 


TEST OF HYPOTHESIS Ic: 
AUDIO PORTION ONLY 


COilPARISOi'IS 


BY 


99 


38. 


TEST OF HYPOTHESIS Id; 
AUDIO PORTION OIJLY 


COi^IPARISONS 


BY 


100 



- vii - 



Table 



Page 



39. TEST OF HYPOTHESIS lao COIIPARISOHS OF 101 

TOTAL TEST SCORES 

40. TEST OF HYPOTHESIS lb; COlIPMISOiMS OF 102 

TOTAL TEST SCORES 

41. TEST OF HYPOTHESIS Ic; COiIP/jasOilS OF 103 

TOTAL TEST SCORES 

42. TEST OF HYPOTHESIS Id; COllPARISOHS OF 104 

TOTAL TEST SCORES 

43. PPJSFEREHCE RAKKIHG OF PROGRAIEIATIC AHD ABSOLUTE 105 

COIIPOSITIONS BY CLASS I GROUPED BY TREATHEMT 

44. PREFERENCE RAl^CIHG OF PROGRAiE'iATIC AND ABSOLUTE 106 

COIIPOSITIONS BY CLASS II GROUPED BY TREATIIEilT 

45. PREFERENCE RANICING OF PROGRAIE'iATIC AND ABSOLUTE 107 

COIIPOSITIONS BY CLASS III GROUPED BY TREATMENT 

46. PREFEPJENCE RAInIKING OF PilOGRAl-fllATIC AND ABSOLUTE 108 

COMPOS mONS BY CLASS I GROUPED BY BLOCK 

47. PREFERENCE RANKING OF PROGRAIHIATIC AND ABSOLU'. ’=■. 109 

COMPOSITIONS BY CLASS II GROUPED BY BLOCK 

48. PREFERENCE RANKING OF PROGPJUfllATIC Al-ID ABSOLUTE 110 

COIIPOSITIONS BY CLASS III GROUPED BY BLOQC 

49. ilEAN SCORES BASED ON STUDENTS' PREFERENCES FOR 111 

COIIPOSITIONS STUDIED BY PROGRAMED ANALYSES VS. 
CONVENTIONAL MEANS GROUPED BY TREATilENT 

50. I4EA1I SCORES BASED ON STUDENTS' PREFERENCES FOR 112 

COrlPOSITIONS STUDIED BY PROGRAMED AI'IALYSES VS. 
CONVENTIONAL MEANS GROUPED BY BLOCK 

51. FREQUIilNCIES OF REAXTIONS TO QUESTION REGARDING 113 

EASE OF MANIPULATION OF TAPE ADJUNCTS 

52. FREQUENCIES OF REACTIONS TO QUESTION REGARDING 114 

USEFULNESS OF THE PROGRiV'IS AS A TEACHING TOOL 

IN LEARNING TO LISTEN TO ASSIGNED COMPOSITIONS 

53. FREQUENCIES OF REACTIONS TO QUESTION REG/iilDING 115 

USEFULNESS OF THE PROGRAMS AS k TEAXHING TOOL 

IN LEARNING DESCRIPTIVE WORDS AND PHRASES 
RELATING TO MUSICAL MEANINGS 

- ♦ 



• « * 

vxii - 



Tabl‘d 



Page 

54. FREQUENCIES OF REACTIONS TO QUESTION REGAPvDING 116 

THE COIRENSURATE VALUE OF LEARillNG OUTCOMES 

COMPARED TO TIxiE SPENT STUDYING THE PROGPvAMS 

55. FREQUENCIES OF REACTIONS TO QUESTION REGARDING 117 

USEFULNESS OF LINE SCORE NOTATION IN TEXT 

OF PROGPAIIS 

56. FREQUENCIES OF REACTIONS TO QUESTION REGARDING 118 

VALUE OF THE PROGRAMS IN TEACHING STUDENTS TO 

FOLLOU THE ORCHESTRA SCORE 

57. FIJSQUENCIES OF REACTIONS TO QUESTION REGAilDING 119 

EFFECTIVENESS OF PIANO EXCERPTS ON THE TAPE IN 
POINTING OUT SPECIFIC PATTERNS IN THE 

COMPOSITIONS 

58. FREQUENCIES OF REACTIONS TO QUESTION REGARDING 120 

FLEXIBILITY OF THE PROGRAMS IN MEETING 

STUDENTS’ NEEDS 

59. FPvEQUENCIES OF PvEACTIONS TO RECOIEIENDATIONS 121 

THAT PROGRAIIED AI>}ALYSES BE A PART OF LEARNING 
E2(PERIENCES IN MUSIC 112 

60. FPxEQUiSNCIES OF REACTIONS TO QUESTION REGARDING 122 

THE SIZE OF STEPS IN THE TEICT OF THE 

PROGRAIIED Al^ALYSES 

61. FREQUENCIES OF REACTIONS TO QUESTION REGARDING 123 

AMOUNT OF TR/NSFER FROM PROGRAIIED AI'TALYSES TO 

OTHER LEARNING SITUATIONS IN MUSIC 112 

62. FREQUENCIES OF REACTIONS TO QUESTION REGARDING 124 

FEATURES OF THE TEXTS OF THE PROGRAMED Al^ALYSES 

63. FREQUENCIES OF REACTIONS TO QUESTION POSGARDING 125 

FEATURES OF THE TAPES OF THE PROGRAMED Ai'lALYSES 



- ix - 




LIST OF iSXPERIilENTAL MATERIALS 




II 

n 

0 

1 

n 

a 



D 

Q 

0 

0 

0 



(Appendix B) 



Item Page 

1. I'lusic 112 Calendar for Experiment 127 

2. Instructor's Announcement Concerning 128 

Experimental Procedure 

3. Sample Listening Assignment Sheet 131 

4. Assignment Sheet for Programmatic Compositions 133 

5. Listening Test Based on Fourth Movement of 134 

Haydn's Symphony 101 (Rondo) 

6. Listening Test Based on Second iiovement of 148 

Haydn's Symphony 94 (Variations) 

7. Listening Test Based on First Movement of 165 

Mozart's Symphony 40 (Sonata) 

8. Follow-up Test: Form N (Rondo) IbO 

9. Follow-up Test: Form P (Variations) 192 

10. Follow-up Test; Form 0 (Sonata 205 

11. Preference Form 218 

12. Questionnaire Letter of Transmittal 219 

13. Questionnaire 220 




- x - 



ACKN0V7LEDGMENTS 



Special acknowledgments are extended to the following 
staff members of the Department of Music, State University 
of New York College at Cortland: Professors Carl Evans, 

Samuel Forcucci, Ellen Llgon, Vincent Picerno, and Walter 
Ross. Their willing cooperation in the prosecution of this 
research was most noteworthy and in the very best spirit of 
professional and academic tradition. Also without the active 
support of several administrative offices of Cortland College 
in establishing certain conditions and facilities.^ the study 
could not have proceeded. 

The author is Indebted to the consultants who agreed to 
share with him their expertise and experience in carrying 
through the various stages of the investigation. The music 
consultant was Dr. Earl George, Associate Professor of Music, 
Syracuse University. The consultant for statistics, research 
design, and measurement was Dr. Jason Millman, Associate Pro- 
fessor of Educational Psychology, Cornell University. 

The Research Foundation of the State University of New 
York played an important supportive role in providing expert 
counsel in the preparation of official papers and documents 
as well as in various liaison relationships between the in- 
vestigator and the funding agency. The author is grateful 
to the offices of this Important unit of the state univer- 
sity for their cooperation and facilitating services. 



CBN 



- xi - 



SUMMARY 



The purpose of this Investigation was to determine whether 
or not individuals’ aural perceptions for the structural components 
of form relating to specific musical compositions can be more highly 
developed using programed analyses of the work than by studying the 
works with conventional materials and aids. The following hypotheses 
were formulated to be tested: 

1, All things being equal, a student's perceptive skills 
t elating to the components of structure In a musical 
work will be greater when 

a. the student studied the composition under examination 
using an adjunct linearly constructed program vjith 
Inserted magnetic tape musical references. 

b. the student studied another composition than that 
under examination but having the same musical design 
(l.e., rondo, variations, etc.) using an adjunct 
linearly constructed program vdth Inserted magnetic 
tape musical references. 

c. the student studied a greater number of compositions 
using adjunct linearly constructed programs with in- 
serted tape musical references. 

d. the student studied earlier In the term a composition 
using an adjunct linearly constructed prograra with 
Inserted magnetic tape musical references. 

2. Students who have completed the programed analyses will 
demonstrate a greater preference for absolute music than 
students not exposed to the adjunct programs, and the more 
adjunct programs completed, the more likely that this dif- 
ference will be observed. Preference will be greatest for 
compositions of the musical form studied using the adjunct 
materials . 

Three movements of symphonies from the Classic period were selected 
to be programed: 1. the first movement of Mozart's 40th Symphony in 

g minor (sonata-allegro), 2, the second movement of Haydn's 94th Symphony 
in G Major (variations), 3. the fourth movement of Haydn's lOlst Sym- 
phony In D. Major (rondo). Each of the programed tests was structured 
to be used with an adjunct tape of musical references In order to remove 
the restriction of the usual requisite ability to read the musical score. 



However, the score was provided in the event that students could take 
advantage of it. 

The sample selected for the experiment was 339 non-music major 
college undergraduates enrolled in a one semester Introductory music 
literature course entitled Foundations of Musical Expression. The 
students were grouped in three sections, each with a different in- 
structor. The experimental materials were used for outside-of-class 
assignments only. 

Perceptive skills relating to the three programed works were 
measured by three achievement tests. Transfer of learning was eval- 
uated by the administration of achievement tests of similar construc- 
tion of three unfamiliar works in the same form as those programed. 

The criterion developed to determine changes in the subjects* pref- 
erences brought about by the experimental materials was an inventory 
consisting of all possible paired comparisons of the three programed 
works and three selections classified as "program music". Finally, 
a questionnaire was used to assess attitudes of students who used 
the programs to the value of the experimental materials. 

The use of a balanced multiple comparisons design made It possible 
to test the hypotheses concerned with the perceptive skills in several 
ways. Since the total sample of subjects was grouped into three sep- 
arate sections, the experimental conditions were replicated by class 
and Instructor. 

Each achievement test was constructed so that In addition to the 
total test, it was possible to determine perceptions of Individuals 
In five sub tests: harmony and tonality, knowledge of form, melody, 
rhythm, and timbre. Some of the subtests were found to have unusually 
low reliability Indices; the low proportion of significant differences 
between experimental and control subgroups in some of the subskills 
probably was due in large degree to this circumstance. Tests of the 
hypotheses listed above provided evidence for the following conclusions 

Hypothesis Conclusion 

la Individual's aural skills relating to a given 

work are developed significantly more when the 
work is studied by the use of a programed analysis 
and tape adjunct than the aural skills of persons 
who study the same work by means of textbooks, 
conventional library references, and recordings. 



Hypothesis 

lb 



Conclusion 



The form of a composition does not appear to be 
a significant factor in facilitating the transfer 
of aural skills from one composition to another 
even though a programed analysis is the means of 
instruction in the earlier composi^.ion. 

Ic A direct relationship between numbers of programed 

analyses studied and a significantly keener aural 
perception of variations of modifications of musical 
sounds in works not previously audited cannot be 
clearly established. 

Id Tho transfer of an Individual's aural skills de- 

veloped earlier to an unfamiliar work is not signi- 
ficantly modified by the factor of relative time 
placement in a term of study that a programed an- 
alysis is introduced. 

2 An increase in the amount of knowledge an Individual 

has concerning a particular composition does not seem 
to be a factor in modifying his preference for that 
composition as listening fare compared to others. Hor 
are preference patterns for music categorized as pro- 
gram music or absolute music determined by an indivi- 
dual's aural skills in general or as these relate to 
specific compositions. Studying a composition by the 
use of a programed analysis is not a factor in modi- 
fying students' preferences for that composition when 
compared to compositions studied by conventional means. 

Also, data from the questionnaire survey revealed that the majority 
of the students appreciated the guidance of the programed analyses. They 
felt that the programs were powerful instructional tools and that they 
learned a great deal from them. Survey data and observations made during 
the experimental procedure revealed that the use of programed analyses 
in a course of instruction in music literature presents no unique prob- 
lems with reference to practical considerations such as staffing, space 
or students' manipulation of the tape equipment. 




CHAPTER I 



INTRODUCTION 



Statement of the Problem 



It is generally recognized that individuals who have a 
nominal background of music training have considerable diffi- 
culty in perceiving the structure of large musical forms. The 
subtleties of combinations and arrangements of the components 
which comprise the work are elusive to them and often escape 
their attention. Even though they may have been exposed to for- 
mal courses in which the aim is to teach them the basic skills 
and foundations of theory, the gap between these knowledges and 
audio recognition of the musical components within the context of 
large art forms is considerable. In short, teaching individuals 
to learn to listen intelligently to a complex musical structure 
Is not an easy task. 

Many music educators share the opinion that only those in- 
dividuals who have a reasonably good foundation in music have the 
necessary skills to profit from instruction in serious music lit- 
erature. Furthermore, some educators believe that performing ex- 
perience is a necessary prerequisite for those who expect to develop 
a satisfactory degree of sensitivity and sophistication to musical 
expressions because as performers it has been absolutely necessary 
for them to become actively involved in the structure of musical 
forms and have consequently discovered at first hand how a given 
work is shaped. 

Students who have a limited background in aural skills and 
music theory often say that they do enjoy listening to music but 
are unaware of the source of their pleasure. They seem to recog- 
nize that their enjoyment derives from the total "mass" of the 
sound rather than through an appreciation for the development of 
the architectural structure. The main impact of the music for 
them apparently is a generalized emotional response to undiffer- 
entiated sounds. 

Much listening to great i-jorks of musical art \\rith the guidance 
of a competent instructor will do much to help the student sharpen 
his perceptive powers. With large and ever-growing classes and/or 
numbers of classes in introductory music literature courses, how- 
ever, ways and means must be developed to maintain efficiency of 



instruction without a corresponding Increase in cost. Large 
classes are now common in many schools T»hich offer students an 
exposure to great works musical literature. In many institu- 
tions, courses developed xor this purpose are required, at least 
for arts and sciences majors. It becomes imperative that stu- 
dents are expected to complete outside-of-class listening assign- 
ments as partial fulfillments of course requirements. Two primary 
obstacles face students in preparing these assignments : a) their 

limited aural acuity, and b) their unfamiliarity with the orches- 
tral score. 

Analyses of assigned works by means of linear programing* 
using specially-prepared magnetic tape adjuncts to a written test 
for each composition seem to hold promise. This procedure should 
prove to be practical and effective since such an analysis a) can 
be built to reach the student with a limited background of knowledge 
in musical skills, b) would allow the student to proceed at his ox^n 
rate of speed and c) would Include a tape which can provide direct 
allusions to specific sounds of the work rather than requiring the 
student to interpret music notation symbols. Multiple copies of 
the tape and text would enable each student to modify the use of 
the teaching materials to suit his own needs, so that numbers of 
students to be accommodated need only be limited by the number of 
tape transports and headphones available. 

The construction of programs to Instruct individuals in in- 
creasing their aual acuity relative to musical sounds is not new. 

As will be shown later, other researchers have carried on inves- 
tigations which were limited to teaching single entities (dimen- 
sions) of sound such as melodic structure, recognition of intervals 
and the like. The skill involved in perceiving the patterning and 
structuring in a major musical work as it takes shape seems to be 
a behavior of a higher order than the perception of short and direct 
"uni -dimensional*’ musical stimuli. Hence it is pertinent to ask the 
question a) can analyses of certain musical works be programed for 
individuals whose basic training in music is informal, and b) will 
such analytical programs be sufficiently effective teaching tools 
for increasing their aural perception (relating to the design of 
the work) to merit their use? 



*The term "programing** here refers to the technique of arranging 
material to be learned in such a way that a student can reach de- 
sired objectives by proceeding through a series of carefully construct- 

ed small steps. The student must actively respond to questions or 
must complete statements at each step. He is asked to check his re- 
sponses with those supplied by the author of the program; if the de- 
vice is properly constructed, he will respond correctly a high percen- 
tage of the time and the desired behavior is thus reinforced. 



It would be of considerable interest also to discover whether 
or not programed analyses of certain musical works would modify the 
preferences of individuals in favor of the works so analyzed when 
compared to others. It has often been observed, for example, that 
persons with a nominal background of musical training will prefer 
to listen to programmatic music* since they are able to make refer- 
ences and associations to known objects or ideas when the musical 
stimulus is presented. However, when a composition with no pro- 
grammmatic reference is performed, the untrained ear has little to 
guide it - unless, of course, the individual responds in a simple 
emotional manner to the total effect of the sounds; hence the per- 
son may tend not to "like" it as well as a programmatic composition. 

A measure of preference change would provide evidence to test 
the validity of the assumption that increased knowledge about a 
work will modify a person’s preference for the composition. Tlie 
relevance of this finding is apparent since preference for parti- 
cular kinds of music obviously is an important factor in an indi- 
vidual’s choice of listening fare. Thus, for example, such prefer- 
ences could affect which concerts he attends, which recordings he 
purchases, or in the case of a teacher, which compositions are se- 
lected for a listening lesson. 

In summary, many persons v/lth a limited background of skills 
in music who wish to acquire some basic knowledge of music liter- 
ature beyond a superficial acquaintance are not sufficiently skilled 
to perceive the structure of a relatively complex art form. It is 
important that individuals learn to make essential aural discrimina- 
tions within the context of these works. This is exceedingly dif- 
ficult for the minimally-prepared student who finds himself at a 
loss to know how to listen when a) his ear is not trained to make 
certain discriminations and b) his ability to follow the orchestral 
score as a guide is limited. The problem posed in this investigation 
is to discover if it is possible and feasible to utilize a linear 
programming technique for the anlysis of a given work when magnetic 
tape adjuncts are used as guides rather than music notation and the 
conventional orchestra score. 

This investigation was planned to discover if analyses of com- 
positions could be programed, whether such techniques are practical 
and whether or not the programs are efficient instructional tools 
for the student. The experimental design is constructed so that 



*Programmatic (or "program") music is instrumental music which 
is expressly written to describe moods or topics suggested by the 
title or the program accompanying the music. 



tests can also be made of the relative value of programs based 
on compositions written in certain forms,* their order and their 
placement in the terra as well as the effect of the number of pro- 
grams read. Therefore, the following hypotheses were set up to 
be tested: 

1. All things being equal, a student's perceptive skills 
relating to the components of structure in a musical 
work will be greater when 

a. the student studied the composition under examin- 
ation using an adjunct linearly constructed program 
with Inserted magnetic tape musical references. 

b. the student studied another composition than that 
under examination but having the same musical de- 
sign (l.e., rondo, theme and variations, etc.) 
using an adjunct linearly constructed program with 
Inserted magnetic tape musical references. 

c. the student studied a greater number of compositions 
using adjunct linearly constructed programs with in- 
serted magnetic tape musical references. 

d. the student studied earlier in the term a composi- 
tion using an adjunct linearly constructed program 
with Inserted magnetic tape musical references. 

2. Students who have completed the programed analyses will dem- 
onstrate a greater preference for absolute music** than stu- 
dents not exposed to the adjunct programs, and the more ad- 
junct programs completed, the more likely that this difference 
in preferences will be observed. Preference will be greater 
for compositions of the musical form studied using the adjunct 
materials. 



*In this study, rondo, sonata-allegro, and theme and variations 
were examined. 

**Absolute music describes music which is intended to have as its 
principal focal point the form or the design Itself for the listener's 
enjoyment. 



Review of the Literature 



The following section of this chapter will be a review and 
an evaluation of studies in music education which were designed 
to assess the contribution of linear programing techniques in 
increasing individuals* aural perceptions for specific musical 
sounds. Some Investigations which have indirect relevance to 
this study will also be discussed. Although programed instruc- 
tion in music education is not new, Varty (1966)^ points out in 
his on-going survey of the use of programs that the general area 
of fine arts, compared to the field of natural sciences, has made 
relatively little use of them in formal instruction. No reports 
are available which appraise the effectiveness of programed in- 
struction as a means of improving the aural perception of indivi- 
duals relating to the design of works of particular musical forms. 
Rather, research in programed techniques consistently has referred 
to improvement in basic aural skills taken singly and outside spe- 
cific art structures. 

The technique of using a printed text accompanied by a tape 
in programing musical sound has been shown to be both practical 
and feasible. Kanable (1964)^ recently completed a doctoral experi- 
ment involving a programed approach to sight f;inging using students 
enrolled in the Summer High School Music Project at Northwestern 
University. This was an effort to discover if the programed text- 
and-tape technique could be used to give students an adequate back- 
ground in music before they entered college course work. The pro- 
gramed text-and-tape method reportedly seemed to be a practical 
plan of teaching for the development of basic music skills used in 
sight singing. In such a course, the emphasis is placed upon re- 
cognition and performance of rhythmic and melodic patterns. 

Carlsen (1964)^ supported Kanable *s conclusions in a paper 
delivered at the March, 1964, Music Educators National Conference 
in Philadelphia. He also reported that Maltzmann at Harvard Univ- 
sity is seeking to discover whether through programing devices, 
certain identifiable discriminations by individuals can be ob- 
served in the development of young children as they try to develop 
perceptive skills in music, Nhlle his work has little relevance 
to the design of the present study, it is cited to lend credence 
to the proposal that aural perception in music can be developed 
through programed instruction. 

Two recent studies have been published to support the contention 



^Footnote references are listed at the end of Chapter IV. 
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that Individuals can teach themselves to develop fine aural dis- 
criminations through a programed course of Instruction. The first 
of these was reported by Spohn (1963)^. He and his coworkers pro- 
gramed melodic and harmonic Interval material for students entering 
the Ohio State University School of Music. They discovered that 
students using the program did significantly better than the com- 
parison group. Unfortunately, Spohn *s conclusions can only be 
accepted as tentative since his experimental subjects were those 
who completed the programed course of study while his comparison 
group (controls) were students who, for unknown (or at least un- 
reported) reasons, chose not to complete the program. 

In the same paper, Spohn stated that different methods of pro- 
graming appear to result in different outcomes and his findings In- 
dicate that for some elements of music, certain combinations of stim- 
uli and modes of presentation vary in their effectiveness. He at- 
tributes this to differences among learners themselves. The impor- 
tance of this statement to the present investigation is that pro- 
graming material for music majors possibly is a different procedure 
than for non-majors. However, Spohn presents no statistical data 
to support his point of view. 

The second reference is provided by Ihrke’s work (1958)^. He 
presented the pedagogical and psychological bases for his research 
in the development of an automated device for the perception of 
rhythm. Ihrke is working to achieve three objectives in programed 
instruction: a) develop equipment in which the response is rhythmic 
performance, b) develop equipment which provides immediate and in- 
stantaneous feedback to the student, and c) develop an effective 
training program. Here again, the central aim is to help the in- 
dividual achieve aural sensitivity to a single aspect of the musical 
texture. 

The contributions of two educators who have constructed tech- 
niques to develop students' awareness for the total musical expression 
in the context of an art work are relevant to this discussion. They 
are important because the teaching procedures each person developed 
are systematic and planned experiences to aid the students to grasp 
the music's sense through a ’’guided tour" of the shape and design 
as the work is being presented. 

Kearns (1964)^ claimed a successful attempt at teaching non- 
music majors to listen for specific musical structures using illus- 
trations on magnetic tape. His method consisted of recording the 
composition and then dubbing the instructor's voice on the tape 
directly over the music providing spoken numbered reference points 
for the students. The subjects were given drill sheets on which 
they were to match the lettered responses on the sheets— which 



represented Ideas about the music— with the numbers called out 
on the tape, Kearns* procedure differs significantly from the 
present research project in three ways; a) the subjects listened 
in groups rather than individually, b) the material presented was 
not programed in small graduated steps, c) no attempt was made to 
call attention to the craftsmanship or artistry of the composer in 
his expressions 5 instead the instruction consisted of a simple 
numbering of the parts which then the students were asked to iden- 
tify. 



Mueller (1964)? utilized a visual medium of communication 
rather than sound to help students perceive the structure of the 
art form. She writes that she used a device to teach non-music 
majors about form by means of charts. These charts showed the 
motifs, developments, and to some extent, the harmonization of 
a number of musical compositions in color. The charts were txi^elve 
Inches high and sometimes as much as twenty-five feet long. Thus 
the student could look at a chart of the form as the music was pre- 
sented, This method, unfortunately, requires that the student have 
sufficient skill to read musical notation in order to follow the 
device; for him to pick out the music represented on the chart by 
notation from the complete tapestry of sound he hears calls for an 
ability he often does not possess. 

The studies discussed above clearly demonstrate that a method 
of reaching the non-music major is needed employing a system he is 
able to utilize. The majority of students are nonetheless basi- 
cally capable of recognizing most variations in the sounds them- 
selves. Certainly if they are not capable of the latter, it will 
do little good to attempt to develop their comprehension through 
the notation. The investigators have shown that it is practical 
and feasible to use magnetic tape adjuncts with programed texts to 
develop aural skills prerequisite to the perception of musical de- 
sign. It has also been shown in the preceding evaluation that the 
efforts made to date have concerned themselves with the Improvement 
of perception in specific single dimensions of musical sounds, or 
more specifically, outside the context of the art form. 



10 - 



CHAPTER II 



METHODS 



The purpose of this chapter will be to state in detail the basic 
features of the experimental design and the procedure used for its 
implementation. Emphasis is given to a description of the sample of 
individuals selected for the experiment, assignment of subjects to 
the various treatment subgroups and the experimental treatment. An 
outline of the relevant aspects of the instructional materials used 
both for the experimental and control groups is presented. The cri- 
teria used to compare the subgroups as determined by the experimental 
design as well as the pretest which was used as a blocking variable 
are Introduced in this section of the report. Finally, the system 
of controls which were built into the design to suppress undesirable 
sources of variation is explained and discussed. 



Selection and Description of the Sample 



The central objective in this investigation is to note the 
effect of carefully prepared programed analyses of certain large 
musical forms as teaching devices for the students who have had 
neither the benefit of special music preparation nor Intend to 
continue serious study of music. At the college level, such in- 
dividuals enrolled in music courses are indentlfled as non-music 
majors. It is to be expected, of course, that a certain percent- 
age of non-majors enter college with a fairly good background of 
experiences in music. However, the majority of them have consid- 
erable difficulty in reading the musical score and an even larger 
proportion of these individuals have untrained ears; that is to 
say, their aural perception is generalized rather than attuned 
to particular sounds and sound patterns. For such students who 
expect to register for courses to learn about music of Western 
civilization, a semester of introductory general theory and ear 
training is often required. However, it has been noted that there 
is a formidable gap between successful achievement in beginning 
theory and ear training and the ability to apply these to the new 
situation of aurally perceiving the structural components of major 
musical expressions (which form the basic content of such music 
literature courses). 

It is to this large population of college students that the 
investigation is addressed, although certainly the results could 
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be generalized to any like group of individuals. The author selected 
as his sample of subjects non-music majors at State University College » 
Cortland, New York, registered in the course Foundations of Musical 
Expression II during the second semester of the 1966-67 academic year. 
The broad aim of the course is to develop the ability to listen intel- 
ligently to music; the subject matter content is limited to the great 
music of Western civilization mainly from the 17th century to modem 
times and taught within an historical context. The class meets three 
hours per week for a total of three semester hours of credit. The pre- 
requisite is Foundations of Musical Expression I, which is a course in 
elementary theory and ear training. 

For the purposes of the experiment, students were permitted to 
select any one of three ^ different sections (each was instructed by a 
different professor). The total number of students enrolled in Mu 112 
was 339; Class registrations were as follows: Class I, 124; Class II 

109; Class III, 106. Table I shows the distribution of students in each 
of the classes according to major field of concentration and classifi- 
cation. 



Table 1 

DISTRUBUTION OF ALL STUDENTS ENROLLED IN MU 112 
SPRING SEMESTER, 1967, ACCORDING TO MAJOR 
FIELD OF STUDY AND CLASSIFICATION 







Classification 








Major 


Freshman 


Sophomore 


Junior 


Senior 


Totals 


Elementary 


Education 


20 


150 


5 


14 


189 


Secondary 


Education 


17 


57 


7 


11 


92 


Physical 


Health & 


10 


0 


0 


0 


10 


Recreation 

Education 

Liberal Arts 


24 


15 


5 


2 


46 


Totals 


71 


242 


17 


27 


337» 



^Although 339 students registered for the course, two students 
dropped out of class at mid-term because of illness. 



- 12 - 



Each class was subdivided by the Instructor Into four small 
"laboratory" or "recitation" sections so that he could choose to 
meet In a single large lecture session or with groups of students 
for discussion as the need arose. Hoxjever, the registration 
schedule Indicated that students would be required only to meet a 
total of three hours per week In some combination of lecture, and/or 
discussion. The Instructors were free to modify the class schedule 
as they wished; In fact no restrictions were placed on the pro- 
fessors save two; a) sufficient class time was to be released to 
the Investigator for testing, and b) no Instruction x^as to be given 
using the compositions chosen for the experiment. It was hoped that 
keeping the amount of Intrusion due to experimental manipulations at 
a minimum would preserve as normal a setting as possible. 

One Instructor (Class III) chose to meet his class four periods 
per week; the fourth period was a required "listening" session. 

The Instructor felt this was a necessary adjustment since the 
testing periods due to the experimental design used up nearly one- 
fourth of the term. The other two Instructors did not modify In 
this manner. 

At the beginning of the term. In each of the three sections, 
the students were pretested on Colwell's EIIAT*. On the basis of 
the distribution of scores from this testing, students on each 
section were blocked Into ten ability classifications from high to 
low. Within each ability block or classification, students were 
randomly assigned Into one of seven treatment groups (see Figure 1 
below). The purpose for the blocking, of course, was to make the 
comparisons among the subgroups more powerful. 



Experimental Treatment 



The experimental treatments consisted of studying either one 
or two of the three programed analyses constructed for this In- 
vestigation. A complete description of these materials will be 
given later In this chapter. None of these materials was Intended 
for use In the classroom; all the compositions upon which the 
programs were based were assigned for Individual outslde-the-classroom 
study. No experimental variables operated x^rlthln the confines of 
the classroom; that Is to say, treatment and Instruction were the 
same for all students In all three classes within the classroom. 



*The EMAT (Elementary Music Achievement Test) authored by 
Richard Colwell and published by Follet, measures the ability 
of students to differentiate sounds according to pitch, rhythm, 
modality (melodic and harmonic) and tonality (see references. 
Item 8 at end of Chapter IV) • 



The control treatment was the conventional means which music 
departments generally use for outside-of-class listening assign- 
ments. Students had access to disk and tape recordings of the 
compositions, an orchestral score, and reference texts of differ- 
ent kinds. A common reference text used at Cortland is Moore and 
Heger The Symphony and the Symphonic Poem , published by Ulrich’s 
Books, Inc. (1966)*. Often the texts for the course have analyti- 
cal summaries of the compositions to which the students have been 
assigned. All of the conventional materials (disk and tape re- 
cordings, scores, reference books, and sound reproducing equipment) 
are located in a central room known as the music library. The 
maximum number of students which can use the facilities at one time 
is 32. 



Once an assignment is made for outside preparation, the student 
normally is expected to obtain the necessary score, recording, and 
reference literature through the use of the card catalog. From that 
po it on, he must draw upon his own resources to put meaning into 
the sounds he hears. 



The Experimental Materials . The author programed three rep- 
resentative instrumental compositions of symphonic proportions from 
the Classic period, each in a different form. The forms chosen were 
rondo, theme and variations, and sonata-allegro. Selecting for the 
experiment works of different styles or from different periods could 
possibly introduce undesirable or irrelevant variations among the 
subgroups. For this reason, each work is a single movement of a 
symphony from the Classic period. It also was tempting to consider 
the possibility that purer examples of a particular form — such as 
the rondo, for example — might be obtained from chamber music works 
(l.e., string quartets, piano trios, quintets, etc.) which would 
Illustrate various techniques better, or to Include movements or 
sections from larger compositions such as oratorios or concertl. It 
may be that such dimensions could have made for an interesting study, 
but in order to establish the controls necessary to test the hypotheses, 
the works were purposely chosen for their similarity on the basis of 
non-experlmental criteria. 

The compositions programed were as follows: a) the first move- 

ment of Mozart's AOth Symphony in g minor (sonata-allegro form), b) 
the second movement of Haydn's 94th Symphony in G Major (theme and 
variations form), and c) the fourth movement of Haydn's 101st Sym- 
phony in D Major (rondo form) . The recordings from which the audio 



*The works assigned in this experiment are all analyzed In 
Moore's and Heger's text (see references, item 9). 
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examples were taken were a) Mozart’s 40th, the Berlin Philharmonic 
with Ferenc Fricsay conducting, (Deutsche Grammophon Gesellschaft) , 
b) Haydn’s 94th, the Berlin Philharmonic Orchestra with Karl Richter 
conducting, (Deutsche Grammophon Gesellschaft) and c) Haydn’s 101st, 
the Berlin Philharmonic Orchestra with Karl Richter conducting, 
(Deutsche Grammophon Gesellschaft) . 

The programs consist of the printed (ditto-duplicated) text, the 
magnetic tape adjunct, and the orchestral score (mlnlature-sized) . 

The text Is in either three or four sections, depending upon the 
convenient lay-out of the material. In each of the three programs, 
the first section deals with a review of the particular form with- 
out direct reference to the music. For example. In the sonata- 
allegro form, the first section takes the student through the specific 
aspects of the classic structure so that he not only learns the def- 
initions of terms but also their meaning and relevance In the context 
of the entire form. 

The remaining sections of the sonata form program are built from 
the consideration of the music Itself: acquaintance with thematic 

material, the development of the material and most of the techniques 
the composer used to shape the work Into the entire artistic structure. 
One section deals with the exposition section of the work, another with 
the development and another with the closing restatement section. At 
convenient places, summaries are presented in order that the student 
can hear the smaller segments of the work In a larger perspective. At 
the beginning of that part of the program dealing directly with the 
sounds of the music and also at the end of the text, the entire move- 
ment Is played on the tape. Major parts of the movement (i.e., the 
coda, a single variation, the development section, etc.) are also 
treated In the same way in order to bring the student from an acquain- 
tance and understanding of smaller segments to the realization of the 
entire sub-structure. Thus, the general plan of each program Is a 
synthesis-analysis-synthesis approach . 

At no place In the program Is the student required to under- 
stand the score in order to grasp the significance of the building 
process. Whenever a reference Is made to the music, the sound of 
the music is the stimulus rather than the notation of the sound. This 
Is accomplished by presenting excerpts of the music either by re- 
cording the appropriate section from the record or by means of a 
piano excerpt. A piano excerpt Is Inserted only when the music is 
sufficiently complex to warrant an isolation of the relevant thread 
of sound under examination. At times it becomes necessary - on the 
tape - to give verbal cues to alert students to the specific dimension 
of the excerpt to which attention is called. 

In every instance when reference Is made to an excerpt of the 
sound of the music, a line score of the recorded excerpt is included 



in the frame. This is done to help students become acquainted simul- 
taneously with the symbolic representation of the sound. However, 
no demand is made of the student to master the score. Each of the 
sounds which is programed on each frame is also outlined for the 
student in the orchestra score. Therefore, reference to the sound 
is made in three places; first, and most important, the recorded 
sound on the tape; second, on the line score found on the frame 
Itself; third, in the orchestra score. In order that the student 
can refer quickly and easily to the orchestra score, each measure 
in the score is numbered and sections of the score referred to by 
measure numbers. 

Directions in each of the programed texts point out to the stu- 
dents that it is perfectly all right to repeat recorded excerpts as 
often as they wish to do so in order to be sure the excerpt has the 
Impact it is Intended to make. This is «n Important aspect of the 
programs* value because listening habits of many individuals are 
geared to the custom of one listening or, at best, repeated auditions 
of the whole work. This is in contrast to study habits of students 
using verbal stimuli; students are accustomed to re-reading sentences 
or even phrases. The structure of the program using taped excerpts 
makes this procedure just as feasible using sound stimuli as it is 
with verbal cues. Manipulating tape for repetition is a much simpler 
task than re-setting the stylus on a disk recording and the probability 
of injuring the equipment is much less. Hence, tape equipment en- 
courages the student to audit short sections or excerpts repeatedly 
to perceive the finer details of the music’s structure. The author 
believes this pattern of learning to listen to music is as effective 
as that of the performer who is learning to reproduce a musical work. 

In the latter situation, a serious student will often rehearse very 
short portions of the composition in order to perfect it; it is con- 
sidered generally to be a poor learning technique simply to repeat 
the entire selection. 

Eight copies of each of the three programs (including tapes and 
scores) were made available for the subjects in the experimental groups. 
As will be explained in detail later, the students in each of the 
three classes were assigned different compositions during the same 
periods so that it was considered sufficient to provide eight copies 
of each program. 

Each program was pretested on small samples of individuals to pro- 
vide information useful in editing and revising the texts and the taped 
references. Responses to the frames were checked in order to keep the 
error rate to a level of ten percent or less. 



Procedure . The schematic below (see Figure 1) summarizes the 
differences in treatments among the subgroups for each class. As 
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Group 

Class Within 
Class 




Time Period (order of events) 


1 


2 


3 


4 




OoRa 


EiOi 


E2O2 


C3O3 


Oi *02 '03 *04 




OoRb 


ElOl 


C2O2 


E3O3 


0i»02 '03»04 




OqRc 


ElOl 


C2O2 


C3O3 


0 i» 02 ' 03»04 


I 


OoRd 


CiOl 


E2O2 


C3O3 


0 i» 02 ' 03»04 




OoRe 


CiOi 


C2O2 


E3O3 


0 i» 02 ' 03'04 




OoRf 


CiOi 


E2O2 


E3O3 


0 i» 02 ' 03»04 




OoRg 


CiOi 


C2O2 


C3O3 


Oi» 02 ' 03»04 




0o®h 


E202 


E3O3 


CiOi 


O2 ' O3 » » O4 




OoRi 


E202 


C3O3 


ElOi 


O2 • O3 » 0i » O4 




OoRj 


E202 


C3O3 


CiOi 


O2 ' O3 » 0 ]^ » O4 


II 


®oRk 


C202 


E3O3 


CiOi 


O2 *03 »0i *04 




OoRi 


C202 


C3O3 


EiOi 


02*03 »0i»04 




®oRm 


C202 


E3O3 


EiOi 


02 » 03 » 0 i »04 




OoRn 


C202 


C3O3 


CiOi 


02 * 03 » 0 i «04 




®oRo 


E303 


EiOi 


C202 


03*01 »02'04 




OoRp 


E303 


CiOi 


E2U2 


03 » 0 i» 02'04 




OoRq 


E303 


CiOi 


C202 


03 » 0 i» 02'04 


III 


OoRr 


C303 


EiOi 


C202 


03 » 0 i» 02'04 




OqRs 


C303 


CiOi 


E202 


03 » 0 i» 02»04 




OoRt 


C303 


EiOi 


E202 


03*01 *02 'O4 




OqRu 


C303 


CiOi 


C202 


03*01*02*04 



Figure 1 

Schematic of Experimental Design 



(key for symbols on next page) 



Key for Symbols in Figure 1 



Ra.»»Ru* Randomly assigned groups within each class. 

E; Experimental treatment consisting of adjunct linearly 
constructed program with inserted magnetic tape 
musical references. 

C: Control treatment included in the experiment under 

the same conditions as the experimental treatment 
except for the adjunct programs. 

0: Observation on a dependent variable. 

1, 2, 3, 1*, 2 \ 3*, 4 subscript index numbers refer to the 
specific content of the observation, (o is a test to ascertain 
extent of musical background), 1 is the rondo design selection 
described in the experimental materials; 2 is the theme and 
variations design selection described in the experimental 
materials:: 3 is the sonata-allegro design selection described 
in the experimental materials; 1*, 2’ and 3' are the ’transfer 
selections" in the respective designs; and 4 is paired- 
comparison instrument to measure preference for "absolute" 
vs. "programmatic" music. 




will be pointed out in greater detail later, this arrangement made it 
possible for the Investigator to test a particular hypothesis in sev- 
eral ways in a given class as well as to replicate the observations 
in each of the other two classes. 

Students were randomly assigned into one of six experimental 
subgroups or the large "general control" subgroup. The random sorting 
of students did not take place until after the administration of the 
Colwell EIIAT test, x^hich provided the data upon xi^hich the blocking 
was based. In this way, it is possible to say that the seven sub- 
groups were essentially equivalent on skills as measured by the 
Ei-IAT. Final assignment of students into the subgroups xfl’as not made 
ui»til the last day of "drop— and— add" in the terra in order to minimize 
the probability that the selected students would drop the course.* 

Twelve tape transport stations were placed in a separate room on 
the same floor on which the listening room used regularly by the music 
departDient x*7as located. All students assigned to the programed materials 
for study were requested to use this facility. During the interval of 
time the experimental treatment was in progress, student supervisors 
were assigned to this room as well as to the regular listening room. 

The duties of the room supervisors were as follows: a)verify that the 
students x<rho used the "experimental" or "control" materials were Indeed 
assigned to be there, b) distribute necessary materials, c) distribute 
and collect time sheets so that a record could be made of the ai-ount 
of time each student spent listening. This system provided an adequate 
control so that it x^as virtually impossible for students to have ac- 
cess to the "xTTong" teaching materials. 

For each of the three compositions assigned for outside listening 
during the differential treatment all students received a copy of an 
assignment sheet, which vras distributed in the classroom. (See Appendix 
B for a copy of this form*) Complete directions xi 7 ere Included on the 
form together x^ith the names of the students who were assigned to the 
experimental materials . 

In order to accommodate the students, both listening areas** were 
manned by supervisors for a total of 65 hours each week. The only day 
during the week that the centers were closed was Sunday. It x^as dis- 
covered that this schedule provided ample time for the students' prepar- 



*The experimental subgroups remained intact except for two students 
who dropped out during the last month of the term because of illness. 

**The room in which the experimental materials and equipment were 
housed was referred to as the Tape Listening Room; the regular listen- 
ing room was called the Record Listening Room. 
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ations for the criterion tests, i^o limit was placed on any individual's 
listening time in either facility. Although students were urged to 
complete their assignments, the author felt it inadvisable to insist 
on completion or bring pressure to bear on students in either the ex- 
perimental or control groups to study the assignment. Occasionally, 
students were reminded to study for the tests, particularly when it 
appeared that failure to report to either center was due to an over- 
sight or to other factors not associated with a willful absence. Data 
from the time sheets revealed the fact that a high percentage of stu- 
dents participated, as expected. The mean number of minutes that each 
experimental group spent on the programs is tabled in Chapter. Ill (see 
Table 10). Note that the mean number of minutes spent studying each of 
the programs decreased with each succeeding assignment within each class . 
The largest single contributor to this phenomenon probably is the fact 
that, as the semester wore on, the pressure of other school work con- 
tinued to demand a larger share of the students’ study time. Note also 
that the gradient of the decrease is approximately the same for all 
classes. 

For listening experiences assigned by the instructors, students 
used only the facility housing the conventional materials. There 
was no attempt to keep data on listening time for these assignments. 

One month prior to the end of the term all students were asked 
to listen to three programmatic works also of symphonic proportion. 

These compositions were the orchestral suite Scheherazade by Rimsky- 
Korsakov (The Young Prince and the Young Princess) , the symphonic poem 
Die Iloldau by Smetana, and the concert overture Romeo and Juliet by 
Tchaikowsky, Multiple copies of the recording and the orchestral score 
were placed in the record listening room; there was no attempt made 
to differentiate assignment preparation. 

It will be recalled that the purpose for this procedure was to 
test the hypothesis that students who studied the symphonic movements, 
categorized as absolute music, by means of the experimental materials 
would be different than the other students in their preference rank- 
ings of absolute music compared to programmatic music. It was hypothe- 
sized that the "experimental*' students v 7 ould show a greater preference 
for absolute music than the "control" students. 

The assignment sheet specified that the students were to be tested 
on each of these compositions during the last part of the terra. However, 
class schedules became too crowded to permit the investigator to carry 
out this plan. 



Time Schedule. As will be noted from the calendar for the spring 
semester in Appendix B (page 127), the first assignment was not made until 
nearly three weeks of the term had elapsed. This was done in order to 
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give the Instructors sufficient time to reviev/ the elements of music 
(uhich was the subject of the preceding course) and to introduce gen- 
eral concepts dealing with form. 

The interval of time between each of the three assignments and 
the subsequent test based on each assignment was approximately ten 
days. This interval of time was felt to be necessary in order to 
give students sufficient time to listen as well as to maintain an 
optimal amount of continuity for the professor’s Instructional pro- 
gram. The intervals between each of the three follow-up achievement 
tests - for which no preparation was necessary - was one week; this 
aspect of the testing was carried on in the last month of the semester. 



The Criteria 



General restrictions, each of which shaped the criteria to some 
extent, are as follows: a) each complete test had to be administered 

in a single class period of fifty minutes, b) the major portion of 
each achievement test had to be limited to measures of aural acuity 
in order to test the hypotheses, c) the source of verbal and aural 
stimuli was limited to the composition studied, d) a scoring system 
yielding data vdiich could be handled quantitatively was desirable, 
and e) skills had to be sampled in such a x^ay that the experimental 
group x^jrould not be favored; i.e., the specific references made in the 
experimental materials should be avoided. 



The Achievement Tests for the Assigned Works. , The tests which 
measured the students' achievement on the assigned compositions were 
administered in the first three time periods of the experiment. (Please 
see Figure 1 on page IZ) These instruments were somewhat different than 
the follow-up tests, which were used to measure the amount of transfer 
in the latter three time periods; these x^rill be described in detail below. 

Each test was constructed to measure five skills; thus, a measure 
was obtained for each of five groups (or sub tests) of items as well as 
the total score. The investigator grouped items into the following 
categories : 

1. Riiythm . The ability to hear basic rhythmic and metric 
patterns and deviations from then. 

2. Helody . The ability to perceive and identify melodic 
subjects as thematic material, connecting passages, new 
material and relationships of melodic patterns to each 
other. 



3. Harmony and tonality . The ability to perceive harmonic modi- 
fications and variations and to sense tonality and 
departures from tonality reference points. 

4. Timbre . The ability to identify tonal characteristics of 
choirs of instruments and tonal variations in a sin<>le 
instrument or groups of same instruments (e.g., the first 
violin section) . This skill may be considered synonoraous 
with awareness of the dimensions of orchestration. 

5. Knowledge of general form . The ability to identify large 
patterns of a work and the relationship of smaller units 
to the whole. This skill includes a demonstration of 
knowledge of the particular form under study without 
aural reference to the music. 

Items were not clustered within the test by subtest classifications 
a more effective measurement procedure seemed to be served by ordering 
the items according to the kind of task presented at a given moment. 
However, each sub test contained ten items so that each skill was sampled 
the same number of times. 

As the construction of the measuring instruments progressed during 
the trial period, certain basic features assumed importance: 

1. Tlie test should present the stimulus material in its original 
setting. To simplify the stimuli by "masking out" the ac- 
companying sounds would reduce the validity of the test be- 
cause the testing situation is then different than the sit- 
uation to which the generalization is being made. 

2, Problems should be in the form of items which are presented 
at the time that the audio stimulus is heard. To wait until 
later would introduce the variable of tonal memory^ the study 
is not concerned with this aspect of listening ability and no 
generalizations are intended to be drawn concerning it. 

3. Within reasonable limits, items should deal with a single as- 
pect of the many sounds which the subjects hear within the 
total configuration of the reference. In this way, it is 
possible to dravj conclusions concerning variation among tlie 
subjects* subskills, 

4, The intervals of time consumed in reading and responding to 
verbal cues had to be short. If this could not be done, the 
amount of time devoted to reading and listening would reduce 
the size of the sample of items and the reliability of the test 
would be correspondingly reduced. Because the criterion tests 
had to be administered in the space of one class period, this 
was a vital factor. 
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5. In items containinjf both verbal and audio cues, tue verbal 
stimuli should be stated as simply and concisely as possible 
because the audio cues normally were the primary stimuli. 

6. The length of the audio cue had to be detexiolned in part 
by the complexity of the skill being sampled. Another 
determiner was to select only the part of the work upon 
which the task was based; adding more conceivably could 
have confused the student. 

7. Every effort had to be made to ensure that the student 
clearly understood the specific aspect of the task. Tills 
was a critical factor especially when audio cues were 
presented. 

8. In most cases, the audio cues had to be repeated at least 
once in order that students could review their responses. 

This is consistent with normal testing procedure par- 
ticularly when items are composed entirely of written 
verbal material. In the latter situation, subjects are 
in a position to reread the items. The validity of the 
test should be Increased as a result of this technique 
because what is under examination is accuracy of perception 
rather than speed of response. Finally, repetition of cues 
probably eases the tension which usually obtains in this 
kind of a testing situation. 

Taped references made from the recordings were used as audio 
cues. Ho problem involving aural discrimination was presented with- 
out also first presenting the relevant excerpt or excerpts from 
the composltlou. 

Oral directions necessary to the administration of the test parts 
dealing with the musical references were also placed on the tape. The 
musical references were spaced on the tape at Intervals of tirae consid- 
ered to be sufficient for the response and review. In this way, it was 
possible to maintain consistency of administration among the three classes. 
Once the tape was started, it was unnecessary to stop it until the testin'* 
was completed. 

Questions dealing with a specific response to a musical reference 
were usually constructed in the form of alternate response items. This 
permitted the student to read the verbal cues quickly and released the 
major .amount of his time for concentration on the audio cue. Multiple 
choice items would have detracted the student’s attention from the audio 
cues because of the Increased amount of reading time involved and also 
because more repetitions of the musical excerpts probably would have been 
necessary. Subjects were requested to respond to each of these statements 
with one of the following response codes: 1) confident it is true, 2) 

possibly true, 3) no idea, 4) possibly false, 5) confident it is false. 



Hueller (1956)^® and Ebel (1965)^^ demonstrate the usefulness of 
this technique of confidence weighting in similar situations. 

It should be stated that the short musical references presented 
in the tests do not represent the initial contact the student has 
had V7ith the music. All students had approximately ten days of out- 
side preparation time and hence the musical stimuli should have been 
familiar to their ears. In a sense, therefore, the music references 
are made for the purpose of recalling a section of the composition 
with which they v/ere supposedly already acquainted. 

Approximately two-thirds of the items x/hich called for aural dis- 
criminations were constructed so that two excerpts from the composition 
were played in sequence rather than a single phrase or section. Tlie 
decision to structure the music stimuli in a ’comparative" fashion 
was made after early trials of the testing procedure revealed tiiat 
this increased the likelihood that verbal reference to the music x 7 ould 
take on more meaning. That is, it appeared that the technique of com- 
parison was a m^ans of increasing the reliability and validity of the 
test. A typical item of this kind would read "The tones in the descend- 
ing scale pattern of the first excerpt are in the same rhythm as those 
of the scale patterns of the second excerpt." Trial administrations 
of these kinds of items showed that this method was helpful in isolating 
the desired strands of sound or focusing upon a particular characteristic 
of the musical stimulus. 

The scoring system using the confidence x^eights for the true-false 
items \jas arranged as follox 7 s: 



For a "true" item For a "false" item 



Confident it is true 


= 5 points 


Confident it is true 


= 


1 


point 


Possibly true 


= 4 points 


Possibly true 


s 


2 


points 


No idea 


= 3 points 


No idea 


ss 


3 


points 


Possibly false 


= 2 points 


Possibly false 


ss 


2 


points 


Confident it is false 


= 1 point 


Confident it is false 


= 


5 


points 



Multiple choice items were scored according to the follox^ing plan: 

Keyed response = 5 points 
Wrong response = 1 point 
No response = 3 points 

Students xjere instructed to respond to all cues as objectively as 
possible; all options X'lere carefully explained in the directions printed 
on the test forms. Examination of a sample of the response patterns 
in each of the six tests indicated that all options were exercised by 
the students and that the frequency of non-response x^as relatively low. 



- 24 



The ''possibly true” and "possibly false” responses were used much 
more often than the non-responses, even though a majority of students 
most frequently opted the extreme ends of the choice scale. Rarely, 
however, was it observed that students never exercised alternate 
choices to a "full-credit” response. 



Each of the six achievement tests contained fifty items; hence, 
each instrument had the same theoretically possible range from a min- 
imum of 50 to a maximum of 250 points. In practice, however, the 
range was much smaller; the observed minimum total score on any given 
test was approximately 130 and the maximum total approximately 240. 
Each of the five subtests for all six tests consisted of ten items, 
or one-fifth of each possible total score (i.e., 50 points). 



The Follow-up Achievement Tests . One significantly different 
condition had to be accounted for in the construction of the follow- 
up tests (0]_» 02* 03 »)» it could be assumed that all students had 
heard the movement of the symphony prior to the test. However, in 
the follow-up tests*, this was not the case. It was deemed necessary 
for students to hear the total movement with a serious attempt made 
to preserve continuity before the test began. Due to the time re- 
striction of one fifty-minute class period for a single test, it was 
decided to follow this procedure: 

1. The first section of the test would be devoted to an 
audition of the work divided into four large sections. 

For example, in the sonata-allegro form, the sections 
were a) the exposition b) the development c) the re- 
statement d) the coda. Five items would be built to 
be answered during the first audition of each sect ion, 
making a total of twenty items (five for each section) . 
These items were very short and succintly worded; in 
early trials it was apparent that the difficulty level 
of the items was such that students could listen to 
the section of the work and react to the items with- 
out unduly disturbing their listening. Each section 
was played only once. The function of each section 
in the composition (coda, exposition, etc.) was an- 
nounced on tape previous to the audition of each 
separate section. 



*Based on Haydn's 100s t Symphony (second movement) for the rondo, 
Beethoven's Fifth Symphony (second movement) for theme and variations, 
and Beethoven's 8th Symphony (1st movement) for sonata-allegro. 
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2 . The remainder of each of the measuriiif? instrunents was simi- 
lar to the first three tests in style and forii.at. Short ex- 
cerpts provided the auditory stimulus for items and each of 
these was usually repeated once. 

3 . because students were unfamiliar with these words, it was 
expected that the grand means for each class would be lower 
than in the previous tests. This expectation was borne out. 

The content validity of the achievement tests was perforce estab- 
lished by the fact that all the items which measured aural perception 
relating to the composition x^ere taken from the composition itself. 

In other v;ords, the samplings were not only a direct measurement of 
the skill x^hich the test purported to assess but were taken from the 
same recording to which the students listened in preparation for tlieir 
tests. Furthermore, items from the tests were submitted to a jury of 
five music professors (one was the music consultant for the Investigation) 
concerning the validity of the tests with respect to their stated 
function. Finally, all the examinations were evaluated by the measure- 
ment and statistics consultant, particularly for criticism of item 
structure and format of the test forms. 

An unfortunate aspect of the study x/as that the sub tests which 
dealt with the assessment of aural perception within all six achieve- 
ment tests proved not to have as high reliability indices as had been 
hoped. Reliability Indices for the total test scores using the odd- 
even method of estimation ranged from .46 to .70 computed on randomly 
selected subgroups of students. The audio subtests x^ere even lox^er, 
ranging from a low of .13 for the melody sub tests to .42 for timbre 
sub tests. Knoxjledge of form sub tests ranged from .32 to . 62 . Tliese 
values clearly shox7 that the expected amount of variation generated 
by the tests* low internal consistency were an important factor be- 
cause this source of error x^ould reduce the sensitivity of the cri- 
teria for hypothesis testing. Tlie expectation would be that treatment 
differences reflected in the total test scores and the knox7 ledge of 
form sub test scores probably X70uld be more frequently significant 
since the reliability indices of these were generally higher. This 
proved to be the case, as is noted in the next chapter of the report. 

The melody subtests x/ere least reliable; no significant differences 
due to treatment were found using this sub test as a criterion for any 
of the comparisons. A more thorough discussion of this topic will be 
reported in the analysis chapter. 



The Elementary Music Achievement Test . Tliis instrument, which 
X7as administered at the beginning of the term, provided data which 
were used as a blocking variable in order to make the comparisons 
between treatments more powerful. Permission to modify the test was 
requested from the author because such modification would more nearly 
meet the needs of this study. The sections dealing i/ith auditory -visual 
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discrimination x;ere omitted because skill in reading the score was 
not a vital aspect of the criteria used in this investigation. 

The following sections of the EIjAT were selected as being rele- 
vant as a pretest: 

1, Pitch (two and three tone patterns) 

2, Intervals (phrases) 

3, Tonality (cadences and phrases) 

4, Ilodality (chords and phrases) 

The modified test comprised 77 items; a high score of 101 was 
possible, however, since some of the items required two and sometimes 
three separate discriminations to be made. 

The EMT was administered at the third meeting of the class. 
Students were not Informed of the purpose for the test beyond a per- 
functory explanation that the instrument was being administered to 
ascertain the general level of their achievement in aural perception. 
Raw scores were posted by test number so that a student could evaluate 
his score with respect to the class’s mean. They were informed that 
these data would have no effect on their class standings; the in- 
structors were not apprised of these data (as the investigator promised 
the subjects would be the case) . 



The Preference Inventory . To measure differences in preferences 
for absolute music compared to programmatic music, the students were 
asked to select one of two compositions in each of fifteen sets of 
paired comparisons based on the criterion of which of the two they 
liked better to listen. The pairings were composed of the first three 
symphonic movements which had been studied earlier in the semester 
( Haydn’s 101st Symphony , fourth movement; Haydn’s 94th Symphony , second 
movement; ifozart’s 40th Symphony , first movement, and the three pro- 
grammatic works assigned for study later in the term (Tschaikowsky ’s 
Romeo and Juliet ; Smetana’s Die Iloldau and Rimsky-Korsakov’s 
Scheherazade ) . These six works were arranged into the fifteen possible 
separate pairings and students were requested to make a choice of the 
one they preferred as listening fare in each case. The instrument as 
the students received it is reproduced in Appendix B (page 218). 

In order to help recall for them any specific attitude concerning 
each of the works which they might have acquired as they had studied 
the compositions, the investigator prepared a short tape including 
the themes of each. This was played immediately prior to their filling 
in the preference forms. Furthermore, by means of this technique, it 
was hoped that choices would be based more on a "real’* subjective 
reaction associated with the music instead of a snap judgement possibly 
resulting from irrelevant cues . 



Two methods of scoring the forms were employed. The first 
method consisted of tabulating x^hich composition x^as preferred in 
each of the fifteen pairs end entering the tallies in a table. 

Wlien all tallies x^ere placed on the proper cells, the frequency 
of tallies determined the ranking for any group of students or 
combination of groups (such as all experimental subjects vs. all 
the control subjects) . 

The second method was to assign a plus value of one point for 
each instance where the preferred choice was in the direction of 
the hypothesis, i.e., X7hen an absolute composition over a program- 
matic composition was selected. If the student checked the pro- 
grammatic composition, a negative value of one was tabulated. By 
this means, it was possible to obtain a single score for each 
student *s set of comparisons. In the case of the experimental 
subjects only, this method of scoring yielded tx^o scores; these 
students* preferences provided data to test both parts of the hypo- 
thesis (see the second hypothesis on page 7 ) ; 

1. The first part of the hypothesis proposes that students 
x- 7 ho studied the symphonic movements by means of the 
programed analysis would “overcome*' the oft-observed 
preference of non-music majors for programmatic music 
by demonstrating a greater frequency of preferences 
for absolute music. To test this, a plus value was 
tabulated in each comparison when the student selected 
the absolute music. 

2. The second part of the hypothesis states that the ex- 
perimental students xvould show a stronger preference 
for the compositions they had studied by means of the 
programed analysis. To test this, whenever a student 
selected the work he had studied by the programed an- 
alysis when that composition was one of the pairs, he 
was given a plus score. tJhen his selection was oppo- 
site to the hypothesized direction, a negative value 
of one was assigned to the choice. 

The rationale and assumptions underlying both aspects of the 
hypothesis are as follows: 

1. To a degree, an individual’s preferences for specific 
art objects are formed by his ability to sense the 
purpose or “meaning*' of the art. 

2. The person x^ith a musically-untrained ear strives to 
find meaning in a composition through extra-musical 
factors and associations. These may be facts about 
the music, the composer, or as in programmatic music. 



the program. An appreciation for music with which a person 
is unable to make definite extra-musical associations (such 
as in absolute music) must ultimately come primarily from 
his perception of its structure. 

3, The programed analysis used in this study xi7ill teach the 
subjects to perceive the structural design of the musical 
art form programed. 

These assumptions are not intended in any way to demean the value 
of programmatic music, of course. Neither are they intended to indi- 
cate a superiority of absolute music. Care was taken in this study 
not to imply a heirarchy of values* the compositions were referred to 
by name only (see Appendix B for preference form) . Programmatic music 
which was assigned is from the standard repertory of Western master- 
pieces*. 



The Questionnaire . No hypothesis had been stated concerning 
the attitude of the students to general or specific features of the 
experimental materials. However, as the study progressed, the 
author received several unsolicited evaluative comments from students 
from time to time and since these comments were often at variance 
one with another, it seemed advisable to make a systematic evaluation 
of these attitudes. 

A form consisting of thirteen questions was constructed which 
gave the subjects an opportunity to react to specific aspects of the 
programed texts, the tape adjunct, the use of the orchestra score and 
the equipment. The students could complete the form by checking 
listed possible responses and/or respond to open-ended choices. 
Tabulation was done by a frequency count of checked responses to each 
item on the questionnaire and classifying the kinds of remarks made 
to the open-ended questions. 

In order to avoid unnecessary bias in the responses, four pre- 
cautions were taken; a) the questionnaire was mailed to the students 



*1. Romeo and Juliet Overture (Tschaikowsky) performed by the 
New York Philharmonic Orchestra conducted by Bernstein on the 
Columbia label, 2. Scheherazade (Rimsky -Korsakov) performed by 
L'Orchestre de la Suisse Romande conducted by Ansermet on the London 
label, 3. Die Moldau (Smetana) performed by the Berlin Philharmonic 
Orchestra conducted by von Karajan on the Angel label. 



rather than asking them to complete the forms In class; b) the 
questionnaire was mailed after the term had officially ended (In 
order not to create an Impression that a favorable response was 
"diplomatic”); c) the student did not need to sign his name, unless 
he wanted to be Identified as the author.* d) the student returned 
the questionnaire by mall to the Investigator. 

The code number of each student was placed on the form before 
It was mailed to him. Tills was done for two reasons: first. In 
the event of non-response, the follow-up notices had to be sent; 
secondly. It would be of Interest to note variances X 7 hen replies 
were grouped according to blocks and treatment. 

Follow-up notices were mailed only once; a one hundred per cent 
response was realized without difficulty. The questionnaire form 
and letter of transmittal are Included In Appendix B (see page 219) . 



Controls 



There was no attempt on the part of the Investigator nor the 
three Instructors to hide the fact from the students that a system- 
atic evaluation of some kind v-7as taking place. It was felt that 
students would perform more nearly as they would In a routine class- 
room situation If they were given enough Information to maintain a 
semblance of their normal posture. In order to control for the 
Hawthorne effect during the experiment the students were told that 
an assessment of new materials was being made, with a particular 
emphasis on the tapes and tape transports. No mention was made at 
any time that the central purpose of the experiment was to assess 
the efficacy of programed Instruction as a teaching tool in learning 
to listen to a complex art form. 

The experimental loaterlals were always referred to as ”the 
tapes’* or "the taped compositions’' and the room in V7hich the ex- 
perimental group fulfilled their assignments was called the Tape 
Listening Room^. To be sure, some of the students were aware that 
they were reading a program because they had been exposed earlier to 
programed Instruction in other disciplines. There was no reinforce- 
ment of these observations, however by the Investigator or their 
professors. 



*The room in which the control groups prepared their assignments 
was called the Record Listening Room. Appropriate signs were made 
and attached to the walls near the entrances to both rooms. 



The analytical design also provided a n.easure of control of 
the iiawtiiorne effect (see Figure 2 on page 34 ) . Each of the com- 
parisons used to test the hypotheses involved only students who at 
one time or another had had an opportunity to study by means of the 
experimental materials. These students, therefore, were at one 
time "experimental" subjects and at other times, "controls". Since 
the individuals were not apprised of the hypotheses under test, it 
would have been difficult for any one subgroup to know under which 
particular circumstance they were hypothesized to do better than 
another subgroup. This was particularly true of the comparisons 
made to test the latter three hypotheses (see the listing for hypo- 
theses lb, Ic and Id). 



Summary 



This chapter has been devoted to a detailed explanation of trie 
basic features of the procedure used in conducting the experiment 
itself. A complete description of the sample of subjects chosen to 
test the hypotheses has been made as well as an outline of the basis 
on which the experimental and control groups were selected. The 
differential treatment of experimental and control subgroups within 
each was presented together \7ith a description of the experimental 
materials; the effectiveness of these instructional tools was the 
primary purpose of the study. An accounting of the experimental 
procedure used in this study and the time schedule of the major 
events in the experiment were made in the schematic showing trie 
details of its different stages. The author has also given an ex- 
planation of the construction and use of the criteria develoned for 
the experiment and finally, a description of the controls built into 
the design was made. 
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ClIi\PTER III 



THE AI^IALYSIS AND FIUDINCS 



This section of the report will explain in detail the procedures 
followed to analyze the data obtained by the criteria described in 
the preceding chapter. The primary matter of concern, of course, was 
to discover if any significant differences occurred among groups in 
terms of achievement scores on the several subtests relating to aural 
perception. Tests of Information obtained by the other data~gathering 
devices will also be of Interest; however, these findings will be of 
secondary Importance to the purpose of the study. Separate evaluations 
were made for each of the three sections enrolled in Mu 112; no attempt 
was made to group the classes for the significance tests. 



Description of Classes: EtIAT Scores 




Table 2 shows the means, medians, and quartlles computed from 
scores obtained on the Elementary Music Achievement Test. It will be 
recalled that these data were used to block the students prior to 
their assignment to the subgroups. Because the design does not call 
for a comparison of the classes, no significance tests were made to 
discover differences among them. However, clearly there is little 
variation particularly among the measures of central tendency. The 
data are derived from total scores only; because the total test was 
used as a blocking variable, no useful purpose would have been served 
if subscores had been plotted. 



Table 2 

DESCRIPTION OF ALL CLASSES 
BASED ON EIIAT DATA 



Variable 




Class 




I 

(N»124) 


II 

(NslOS) 


III 

(W=107) 


Mean 


70.4 


70.8 


69.2 


Median 


70.5 


72.0 


69.0 


Ql 


59.0 


62.0 


62.0 


Q3 


79.5 


82.0 


80.0 
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Colwell has no norms for college students so that It is im- 
possible to assess the relative standing of the entire sample on his 
aural skills sub tests* However, the celling of his test which was 
modified for this investigation, was sufficiently high to measure 
every student*. 



Analysis of Achievement Test Scores 



It will be helpful if one would refer to the treatment schematic 
(see Figure 1) as the various comparisons are discussed. Figure 2 
below lists all the comparisons which were made for each class. It 
will be remembered that each test, or observation, consists of five 
subtests as well as the total score. It was decided to test the 
significance of the total score for the purely "audio” portion of the 
test separately as well (the reason for this will be discussed later) . 
Therefore, for each comparison * a significance test was made based on 
the scores for each of seven subtests. To test all four hypotheses 

Id)* 140 significance tests were run for each section, 
or a total of 420 for all three classes. 

Important to note that the experimental variable was 
operative outside the classroom so that the students were exposed 
Indivi dually to the materials; therefore. It was assumed that the 
learning of the students was Independently achieved. Thus, within 
each class, the experimenter felt justified in considering the s tudent 
rather than the entire class to be the experimental unit. 

Scores obtained from the six achievement tests were used to test 
hypotheses I**, lb, Ic and Id (see page 7 ). Hypothesis la is of 
central Importance, of course, since the Investigator is concerned 
whether or not an Individual who studied the programed analysis of 
a particular composition earned a significantly higher score on the 
achievement test based on that composition than one who studied it 
conventionally , 

-^yPQS of comparisons for testing hypothesis la should be 
pointed out. Comparing subgroups R|j vs. on O 3 (see comparison 4 
in Figure 2) is a comparison Which is conveniently called "type 1" 
in Figure 2 - between a group which has studied one of the three 
compositions (composition 2 ) using the experimental materials 
and a group which did not use the analysis for that composition. 



*The highout lau possible was 101. The top-scoring 

etixdgat la all threa yuistlons fell short of that by two points. 



Comparisons* 


Observation 


Relevant 

Hypothesis 


!• vs* (R^j4*Rg+R£) 


0 l 


la (Type 2) 


2. 1/2 (R^+Rf) vs. Rg 


O2 


la (Type 2) 


3. Rg vs, 1/2 (Rb'*'*^c^ 


O2 


la (Type 2) 


4. Rb vs. Rg 


O 3 


la (Type 1) 


5* Rf vs* R^ 


O 3 


la (Type 2) 


6 . Rb vs. Rg 


O 3 


la (Type 2) 


7. Rg vs. Rb 


02 » 


lb 


8 . Rb vs. Rg 


031 


lb 


9. Rg vs. Rg 


^3 


Ic 


10 . 1/2 (Rg+Rb) vs. Rg 


Oi. 


Ic 


11. R£ vs. R^ 


Or 


Ic 


12 * Rb vs. Rg 


02 » 


Ic 


13. Rf vs. R^ 


02* 


Ic 


14. Rg vs. Rg 


03I 


Ic 


15. Rg vs. R^ 


03 


Id 


16. Rb vs. Rf 


03 


Id 


17. Rg vs. Rf 


02 * 


Id 


18. R^ vs. Rg 


02* 


Id 


19. Rb vs. Rf 


03* 


Id 


20 . Rg vs* R^ 


Oy 


Id 



Figure 2 

Comparisons Made to Test Hypotheses la, lb, Ic, and Id 
(Subgroups Shown for Class I Only) 



*The left term of each comparison 
was hypothesized to be greater. 
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In addition, by the time O3 Is made, both and studied the same 
number of compositions (two) using experimental materials during the 
same Interval of time In the semester and both had the first oppor- 
tunity to study by means of programed analyses (for class I shown In 
Figure 2, E3) . Unfortunately, only one of this type of comparison Is 
found In each of the three classes to test hypothesis la. However, 
the other five comparisons which test this hypothesis (shown as "type 2" 
comparisons In Figure 2) also answer the question whether the use of 
the programs made a difference with respect to the criteria. However, 
unlike the "type 1" comparisons In these situations It can't be deter- 
mined whether any program would have been effective or whether It was 
the specific program Itself. 

Excluding group Rg, the comparisons to be made for testing 
hypotheses la, lb, Ic and Id are shown In Figure 2 for Class I. 

Analogous comparisons were made for Classes II and III except that the 
permutations of programs and observations differ as shown In the 
treatment schematic in Figure 1. The left side of each comparison 
In Figure 2 was hypothesized to be greater. 

The experimental design Is orgarized In such a way that all the 
comparisons listed In Figure 2 were made so that the variation due 
to the effects possible from conditions stated in any or all of the 
other three hypotheses can be discounted. For example, consider 
comparison 20: vs. on O3' . Neither R^. nor studied the 

sonata-allegro design (the sonata-allegro design Is the design measured 
by O3) using the experimental materials. Further, both R^ and R^ 
studied the same number (one) of compositions using the experimental 
materials. They differ only in the characteristic under investigation 
stated by hypothesis 1^; namely, subgroups had access to the pro- 
gramed material earlier In the term than Rd and Is thus hypothesized 
to perform better on O3'. 

For each sub test of each achievement test, a two-way analysis 
of variance was made. Treatment (R^, Ri>, etc.) and score on the EMAT 
specified by block were the Independent variables*. 

The Test of Hypothesis la . Figure 2 shows that six comparisons 
(numbers 1 through 6’).vwer^' made among certain subgroups to test the 
first hypothesis. The hypothesis states: all things being equal, a 

student's perceptive skills relating to the components of structure 
(form) in a musical work will be greater when the student studied the 
composition under examination using an adjunct linearly constructed 



*Since the variances due to the Interaction of treatment and block 
were nonsignificant In all cases, the within cell and Interaction mean 
squares were pooled for the purpose of testln*.' the significance of 
the differences among treatment means. 




program with inserted tape musical references. This is the 
central hypothesi«=? of the study; if no differences could be 
found at this point, one could hardly expect differences to be 
discovered in the situations described by hypotheses lb, Ic 
and Id. 



Table 3 



NUMBER OF SIGNIFICANT DIFFERENCES FOUND BY THE 
TESTS OF HYPOTHESIS la BET17EEN THE RELEVANT 
CONTROL AND EXPERIMENTAL SUBGROUPS GROUPED 
BY COIff ARISON (ALL CLASSES) ^ 



Comparison 

Number 








Sub test 






H&T^ 


K of F^ 


Mel^ 


Rhy^ 


Tim® 


Total 

Score 


TOTALS 


1 


1 


3 


0 


0 


0 


2 


6 


2 


2 


2 


0 


2 


0 


2 


8 


3 


0 


2 


0 


1 


0 


1 


4 


4 


1 


2 


0 


0 


1 


2 


6 


5 


1 


1 


0 


1 


0 


2 


6 


6 


0 


2 


0 


0 


0 


1 


3 


TOTALS 


5 


12 


0 


4 


1 


10 


33 



^Harmony and Tonality, ^Knowledge of Form, 

^Melody, ‘^Rhythm, ®Timbre. 

Maximum number of significant differences 
possible for each entry is 3 (comparison times 
number of sections). Thus, for the subtest Form 
in comparison 1, the differences were significant 
every time. 



As will be noted later, the planned comparisons to test this 
hypothesis yielded more significant differences to substantiate 
vit than for the other hypotheses. Comparisons for all subtests 
In all three classes showed 33 significant differences (at the 
5 per cent level) in favor of the relevant experimental subgroups. 
In each of these 33 Instances, the null hypothesis that 
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the differences found between the experimental and control subgroups 
could have arisen due to chance factors was rejected. 

The usefulness of the experimental design becomes apparent 
when the six different means of testing this single hypothesis are 
noted (please refer to Figures 1 and 2). Had the experiment been 
limited to a simple comparison of experimental versus control on 
one criterion using one programed analysis, conclusions concerning 
the hypothesis with these data would have to be approached with 
greater caution. However, as the summary of all comparisons for 
the test of hypothesis la In Table 3 shows, the significance test 
made for each sub test, with the exception of the melody sub test*, 
results In at least one significant difference In the hypothesized 
direction. Table 4 clearly establishes the fact that these results 
are found In all three classes (data from Class III are not nearly 
so clear-cut; a discussion of the reasons for this difference will 
be made later In this section). The data in Table 3 show also that 
each of the six methods (or comparisons) of testing the hypothesis 
yields similar results**. Finally, the achievement tests built on 
each of the three compositions which were programed all resulted In some 
significant differences between the relevant experimental subgroups 
and relevant control subgroups; these data are listed In Table 5. 

The raw score values expressed In means are Included In Tables 15, 19, 
23, 27, 31, 35 and 39 (see Appendix A) grouped bywfsubtest. 

The investigator feels that the frequency of significant 
differences found In favor of the experimental subgroups provides 
sufficient experimental evidence to support the validity of 
hypothesis la. This Is particularly true with respect to the 
following sub tests: a) Harmony and Tonality, b) Knowledge of Form, 

c) Rhythm, and d) the total test. No conclusions can be drawn con- 
cerning the perception of melody and timbre. 



*The melody sub tests In all of the six achievement tests x^ere 
the only subtests which consistently showed no significant differences 
between relevant subgroups for all four hypotheses. One of the main 
reasons for this phenomenon probably Is that the melody sub tests had 
extremely low reliability Indices (see previous section) . 

**The tables summarizing the tests of hypotheses la, lb, Ic, and 
Id do not Include references to comparisons on the "audio" portion 
of the test scores. Although the audio scores were used In testing 
tne hypotheses, they were not an Integral part of the original 
design. References are made to the audio score in the text of 
the report and means for all subgroups on the audio portion of all 
six tests are Included In Appendix A. 
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Table 4 



NUMBER OF SIGNIFICANT DIFFERENCES FOUND BY THE 
TESTS OF HYPOTHESIS la BETWEEN THE RELEVANT 
CONTROL AND EXPERIMENTAL SUBGROUPS GROUPED 
BY CLASS (ALL CLASSES) 











Sub test 






Class* 


H&T 


K 

of F 


Mel 


Rhy 


Tim 


Total 

Score 


TOTALS 


I 


3 


4 


0 


3 


0 


4 


15 


II 


1 


5 


0 


1 


1 


C 


13 


III 


1 


3 


0 


0 


0 


1 


5 


TOTALS 


5 


12 


0 


• 4 


1 


10 


33 






*6 comparisons 


for 


each class 





Table 5 



NUMBER OF SIGNIFICANT DIFFERENCES FOUND BY THE 
TESTS OF HYPOTHESIS la BETWEEN THE RELEVANT 
CONTROL AND EXPERIMENTAL SUBGROUPS GROUPED 
BY COMPOSITION STUDIED (ALL CLASSES) 



Sub test 



Composition* 


H&T 


K 

of F 


Mel 


Rhy 


Tim 


Total 

Score 


TOTALS 


Haydn’s lOlst 
(rondo) 


0 


6 


0 


0 


1 


5 


12 


Haydn’s 94th 
(variations) 


1 


2 


0 


2 


0 


2 


7 


Mozart's 40th 
(sonata) 


4 


4 


0 


2 


0 


3 


14 


TOTALS 


5 


12 


0 


4 


1 


10 


33 



*6 comparisons made for each composition 
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Tables 15, 19, 23, 27, 31, 35 and 39 in Appendix A shov7 
that the means differed between the relevant experimental and 
control groups in the expected direction in 96 of the 126 possible 
comparisons. The number of times that the means of the experimental 
groups exceeded the means of the general control groups was 103 of 
the 126 comparisons. 

Some relevant observations should be made in view of the in- 
consistency of some of the findings. It is clear that the low 
reliability of the criterion measure - particularly so of the 
audio subtests - contributed to these circumstances. That is, 
the rimount of error due to the low internal consistency of the 
sub tests probably was a major factor in many instances where ex- 
pected significant differences did not materialize. 

It seems noteworthy to comment on probable reasons for the 
low reliability of the audio sub tests*. First, a sampling of 
ten items of aural skills (which have been observed by other 
researchers to be difficult to measure accurately) was apparently 
too small. An attempt to discover the optimum number of these 
particular kinds of samplings necessary in order to attain a 
desired level of reliability would be the subject for an inter- 
esting study. 

Second, measuring an individual's perception of sounds as 
these occur vrithin the musical work has been observed by music 
educators to be an inherently difficult procedure. It is easier 
to measure differences of aural acuity relating to a sound 
without reference to the time base upon which sounds are modu- 
lated for "meaning". For example, requiring the perception of 
the difference between two sounds of isolated pitches or rhythm 
patterns is an easier task to measure reliably than to assess 
similar skills of an individual when the same sounds are fitted 
within the context of an art form. Even a very short lapse of 
one's attention may result in a degree of loss unrelated to his 
true ability; once the sound has passed, it is irretrievable. 

This is entirely unlike a printed symbol to which one can return 
again and again for the cue. 



*Early trials and expectations led the author to anticipate 
that the indices might not be very high because of the restric- 
tions noted in the previous chapter. While it is likely that 
the experimental situation itself generated some of the undesirable 
variation however, the tests were expected to function more 
efficiently than was the case. 
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Third, the response being measured nearly always relates 
to a single dimension of a sound which normally is nested within 
a complex of other sounds. It is reasonable to expect that an 
individual's attention could be attracted to the other "mis- 
leading" or "irrelevant" cues. It is also possible that these 
contiguous sounds may even obliterate (for him) the cue to which 
the test statement makes reference. Responses of this klnd^ 
which are the result of Intricate interaction of cues with the 
subject, probably are the largest single source of error in 
measurements of listening ability. 

Attention should also be called in this discussion to the 
knowledge of form subtest. Since this sub test consists primarily 
of items which deal more often with reasoning power (rather than 
aural perception) of the student, it is interesting to note that 
significant differences in the hypothesized direction are noted 
more frequently than in the other sub tests. Tables 3, 4, and 5 
show that in one comparison the relevant experimental subgroups 
in all classes were significantly better than the corresponding 
relevant control subgroups. In four of the five remaining com- 
parisons, note that two-thirds of the time the differences x^ere 
also statistically significant in favor of the relevant experi- 
mental subgroups. Either the programed material was effect! a 
in teaching these particular skills more so than the other skills 
or (which seems more likely) the error in this subtest resulting 
from inconsistencies of measurement was lower and thus revealed 
differences more readily. 

The "audio score" was derived by totaling the responses of 
students to items relating to aural perception only, and vjas 
computed simply by subtracting the knowledge of form score frora 
the total score. This was done in an attempt to discover whether 
or not differences would be noted between relevant subgroups on 
that part of each test eliciting responses to aural stimuli only. 
Unfortunately, the significance tests do not justify a firm con- 
clusion one way or the other even though the means for the 
relevant experimental subgroups were greater in every case than 
for the lelevant control groups in Classes I and II (one differ- 
ence was significantly different in the hypothesized direction) . 

In Class III, this difference was not so clearly defined; in only 
three of the six comparisons, the differences were in the hypo- 
thesized direction. A likely conclusion from this finding is 
that the knowledge of form sub test is the main contributor to 
the reliability of the total test. 

The findings for Class III, which are most clearly seen in 
Table 4, are puzzling in viex-/ of the fact that the data obtained 
in the other two classes do not vary to such a great extent. 

This difference is seen most markedly in the number of significant 



differences between relevant subgroups for total scores (see 
Table 4) . The writer has attempted to make inferences con- 
cerning possible causes for this phenomenon. Since there was 
no control exerted over such factors as selection of the class 
sample, the course outline used, textbooks, hour of meeting. 
Instructor, selection of learning experiences in class, one may 
conclude that some variation could have been generated from any 
or all such sources. 

Of variables which might have contributed most to the 
divergence observed in Class III the following differences 
seem to the investigator the most likely causes: 

1. The instructor of Class 111 was concerned with the 
amount of time taken from his class for gathering 
data to test the effects of the experimental data 
so that he required all his students to attend an 
extra ‘'listening-discussion" hour every week to 
compensate for this loss of time. It is Important 
to note that the other instructors did not make 
such a change in their class schedules. In 
several discussions with this instructor, the 
researcher learned that the main objectives of the 
extra required class hour were to teach the stu- 
dents to listen more intelligently and to learn to 
make Increasingly find discrimination among sounds. 

The instructor of Class III reported that the extra 
listening session was, in his opinion, very effective. 

2. Students in Class III were required to demonstrate 
their progress during the semester by means of six 
achievement examinations, each one of which was 
primarily a listening test. Not only was each of 
these a learning experience, but close observation 
by the investigator in the listening room revealed 
a much greater amount of activity by students in 
Class III than in the other sections (which is to 
be expected, of course). In comparison, students 
in Classes I and II were required to take only two 
achievement tests during the semester. Thus, Class 
III had three times as many learning experiences 
through preparation for their tests as well as 
during the testing periods themselves. 

3. The instructor of Class III also used the "experi- 
mental" composition* which was under test for hypothesis 



*The first movement of Mozart’s 40th Symphony. 
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la in his section as an example of sonata form. For this 
purpose, he charted the entire movement, and duplicated 
and distributed the chart for students in his class*. 

The points above are made simply to explain why results be- 
tween Class III and the other classes differed so markedly. Clearly 
the data had become contaminated and the author cites the three 
items listed above to be the primary causes. 

Finally, the author made a simple count of the number of times 
that the means of the experimental subgroups in each class exceeded 
the means of the relevant control and general control subgroups**. 

Of the 84 comparisons made for the test of hypothesis la in classes 
I and II only, the experimental subgroups’ means were larger than the 
relevant control subgroups’ means in 72 cases. Again in classes I 
and II only, the experimental subgroups’ means exceeded the general 
control groups’ means in 83 of the 84 comparisons. When Class III 
is included, these high proportions drop somev'^hat, of course. 

However, the figures are still impressive 2 the experimental subgroups 
means are greater than the relevant control subgroups ' means in 96 
of the 126 comparisons and in 103 of the 126 comparisons made be- 
tween the experimental subgroups and the general control subgroups. 

To summarize, in vievj of the difficulties encountered by the 
problems of measuring the criterion behavior accurately, and the 
atypical activity of Class III, the frequency of significant differ- 
ences found by testing hypothesis la seems to justify the conclusion 
that the experimental treatment was effective. In other words, it 
does appear that the individuals t^ho studied the compositions by 
means of the programed analyses did significantly better than 
students who studied the same compositions with conventional 
materials and equipment on the basis of the criteria. 



The Test of Hypothesis lb . The rationale which underlies the 
basis for hypothesis lb is that it would be reasonable to expect 



*This was done, unfortunately, in spite of the fact that the 
researcher had specifically requested that the six compositions 
used in the experiment not be used in the classroom nor for outside 
preparation. 

**Recall that students in the "general control" subgroups were 
never exposed to the experimental materials. On the other hand, 
students in the "relevant control" subgroups did have access to the 
programed analyses. The latter groups are "controls" only in the 
sense that, in specific comparisons, particular variables (as des- 
cribed by each of the four hypotheses) were held constant by means 
of these subgroups. 



students to transfer more perceptive skill to a composition not 
studied which is in the same design as one studied by a programed 
analysis than a student who had also studied, equally early in the 
semester, a composition by a program but not in the same form. In 
other xi/ords, the test here is to determine the effect of an in“ 
dividual's familiarity with a composition's basic design on the 
amount of aural transfer. Again, the usefulness of the experimental 
design is readily apparent; not only can the transfer power per se 
of the programs* teaching be assessed, but it permits an inference 
concerning the effect of a musical factor (in this case, form) upon 
the outcome. 

Only two comparisons v;ithin any given class could be made to 
test this hypothesis and still control the other factors under 
consideration. Taking comparison 7 in Class I as the example (see 
Figures 1 and 2), Ra vs, Rb on O 2 ' is an opportunity to study the 
effect of the situation hypothesized, for both subgroups are no 
different in other respects: a) both subgroups had the first op- 

portunity to study a program in the term, b) both subgroups studied 
the same number of programs (two). The same situation holds true 
for the eighth comparison, vs. R^, except that now the test must 
be O 31 . The same kinds of comparisons were possible in each of the 
other tv 7 o sections, but of course, the observations (tests) are 
different in each case because the order of observations was 
different (see Figure 1) , 

None of the fourteen sub test-comparisons in Class I yielded 
significant differences, and only two in each of the other classes 
were significantly different in favor of the relevant experimental 
subgroups. Table 6 summarizes these data. 

Since such a small proportion of the comparisons produced 
significant differences, one would hardly expect to find any 
patterns from which one could draw conclusions concerning the 
hypothesis. Because only four of the 42 comparisons tested were 
significantly different, it is probably safe to fai .1 to reject the 
null hypothesis that differences were due to chance. 

Again it appears that the central problem is the low reliability 
of the subtests. An examination of the raw data in Tables 16, 20, 24, 
28, 32, 36 and 40 (Appendix A) shows that in 28 of the 42 comparisons 
the mean values of the relevant experimental subgroups were in the 
hypothesized direction when compared to the relevant control subgroups. 
In comparing the experimental subgroups to the general control sub- 
groups, 35 of the 42 differences between the means were in the hypo- 
thesized direction. Therefore, it is tempting to suggest that, with 
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more reliable tests, statistically significant differences might 
have been noticed more frequently*. 



Table 6 

SPECIFIC SUBTEST-COIIPARISOIJS YIELDING SIGNIFICANT 
DIFFERENCES TO SUPPORT HYPOTHESIS lb 



Class 


Comparison 


Sub test 


Composition 


II 


0 


Knowledge of Form 


Haydn’s 100th 


II 


3 


Timbre 


Haydn’s 100th 


III 


a 


Harmony & Tonality 


Beethoven's 5tli 


III 


7 


Total Test 


Haydn’s 100th 



The Test of Hypothesis Ic . The purpose of this evaluation is 
to measure the effectiveness of the programed analyses with respect 
to the number of programs studied. It seems likely that students who 
were randomly selected to study two compositions by means of the ex- 
perimental materials xjould profit more from them than students who 
had read only one. 

Note that the hypothesis makes no statement concerning the value 
of a specific combination of analyses - three are possible in this 
investigation (1. rondo and variations, 2. rondo and sonata, 3. vari- 
ations and sonata). Nor is evidence sought in order to make possible 
inferences regarding the way in which these analyses might interact. 

For example, one could hypothesize that the programs are simply additive 
in their impact. Another possible view is that studying a programed 
analysis of two different form.s is better than studying programs of 
two different compositions in the same design. The experimental de- 
sign in this study is structured so that only a simple answer to 
evaluate the effect of number of programs on the criteria can be made. 

A quick glance at Figures 1 and 2 reveals that the test is simply 
comparing subgroups of individuals v;ho have studied two with those 
who have studied one. 



*It is also interesting to note that in 5 of the 6 possible 
comparisons the relevant experiment subgroups obtained higher mean 
scores on the audio portion of the test than either the relevant 
control or general control subgroups. 



All the specific sub test-comparisons which v/ere significantly/ 
different in the hypothesized direction are listed in Table 7, 

Since there are six separate scores (five subtests plus total test) 
for each of the three classes and for each of the six comparisons 
which were possible to run to test hypothesis Ic, 108 specific 
sub test-comparisons were made. In only ten of these instances were 
the means of the relevant experimental subgroups significantly 
larger than the relevant control subgroups (see Table 7) . 



Table 7 

SPECIFIC SUBTEST-COi'IPARISOWS YIELDING SIGNIFICMT 
DIFFERENCES TO SUPPORT HYPOTHESIS Ic 



Class Comparison 



II 


9 


III 


10 


I 


11 


III 


11 


II 


12 


II 


12 


II 


12 


III 


13 


III 


14 


III 


14 



Sub test 

Knowledge of Form 

Rhythm 

Rhythm 

Rhythm 

Knowledge of Form 

Timbre 

Total Test 

Harmony & Tonality 

Timbre 

Total Test 



Composition 
Haydn's 101st 
Beethoven's Eighth 
Haydn's 100th 
Beethoven's Eighth 
Beethoven's Eighth 
Beethoven's Eighth 
Beethoven's Eighth 
Haydn's 100th 
Beethoven's Fifth 
Beethoven's Fifth 



The investigator feels that the acceptance or rejection of 
hypothesis Ic is impossible based on these data. Had the measuring 
tools been somex-zhat more reliable (or to put it another way, had it 
been possible to remove some of the restrictions imposed on the 
testing procedure), a more definite conclusion might have been 
reached. On the other hand, it is of interest to note that the audio 
sub tests listed in Table 7 are more numerous than the total tests or 
knowledge of form subtests. (It will be recalled that this was not 
the case in the data used to test hypothesis la.) Furthermore, 
nearly all the compositions upon which the achievement tests were 
constructed to test the validity of the hypothesis are Xi/orks which 



the students had not' studied beforehand. In other words, since im- 
provement in^^the audio subtests were noted more frequently this 
should encourage the educator to think that auto-instruction could 
prove to be a potent factor in the longitudinal development of 
listening skills. 

The means of the relevant experimentc.l subgroups exceeded the 
means of the relevant control subgroups in only 47 of the 126 com- 
parisons. However, the experimental subgroups’' means surpassed 
those of the general control subgroups in 100 of the 126 cases*. 
Hence, while the latter finding does give additional support to the 
view that programed analyses are generally superior to conventional 
materials and procedures in improving aural perception, the former 
proportion does not help to establish the validity of hypothesis Ic 
one way or the other. 



The Test of Hypothesis Id . It is of value, in the assessment 
of a teaching tool, to discover if the introduction of such a device 
at a particular time in a course of study has any bearing on its 
effectiveness. Of course, the experimental design here is so arranged 
that the test of the hypothesis is limited, to an evaluation of 
relative placement of the programed analyses in the school term. 

This is desirable, since one would obviously expect that the many 
variables operating in any specific course of study will interact 
uniquely with any kind of teaching procedure introduced, so that a 
general conclusion as to relative placement has the most value for 
educators . 



Within each of the three sections involved in this study, six 
comparisons between subgroups were possible to be made to gauge the 
effect of the ’’time placement'' variable. Note again (refer to 
Figures 1 and 2) that placement in the semester is the only main 
effect which differs between the two subgroups in each of the six 
comparisons. Tliat is, the effects of numbers of programs studied 
and the particular forms studied (i.e., rondo, sonata-allegro, 
variations) are equated for each subgroup in all six comparisons. 

Note that comparisons 15 and 16, however, are made using 
compositions which had been studied by both subgroups (albeit by 
the conventional or "control" procedures) while the remaining four 



*For the audio score alone, none of the comparisons was signi- 
ficant. Hovjever, the relevant experimental subgroup means v?ere 
greater than the means of the general control group means in 14 of 
the 18 possible comparisons on the basis of the audio score. 



ERIC 
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comparisons - 17 through 20 - have as the criteria the follow-up com- 
positions which neither subgroup had had the opportunity to audit be- 
fore the tests. One might expect that results among the comparisons 
could vary on the basis of this difference. 

Table 8 shows the six instances where significant differences be- 
tween the experimental and control subgroups v/ere found in the hypo- 
thesized direction for all six comparisons and all three classes. 
Again, recall that the total number of comparisons by sub tests for all 
classes was 126 (7 subtest scores x 6 comparisons x 3 classes). Thus, 
the number of significant differences is actually very small compared 
to the total possible. 

No pattern appears in the tabled comparisons which might lead one 
to draw inferences concerning the value of specific tine arrangements 
nor the kind of criteria used to make the test. Note, however, that 
real differences appear equally often for audio subtests as total test 
and knowledge of fontt scores, which provides some evidence (although 
scanty) that individuals can improve their aural skills by this means 
of self-directed study*. The evidence to support or refute the hypo- 
thesis that the programs are more valuable if introduced earlier in 
the term is, unfortunately, too tenuous to Justify a conclusion one 
way or the other. 

Table 9 gives the scores obtained by the subjects on the six 
achievement tests shown as grand means for all three classes combined. 
The purpose of this table is to provide a reference in the form of a 
summary of the raw data**. 



*It should be noted that while the number of times that the means 
of the relevant experimental subgroups exceeded those of the relevant 
control subgroups on the test of hypothesis Id in 67 of the 126 com- 
parisons, the experimental group means were higher than the general 
control subgroups a much larger proportion of the time: in 94 of the 

126 comparisons. Also, on the audio portion of the scores, the ex- 
perimental subgroups were better than the gesneral control groups (in 
the hypothesized direction) in 13 of the 18 comparisons. 

**For the means obtained in every subgroup of each class on all 
subtests, however, the reader should examine Tables 15 through 42 in 
Appendix A. The mean values presented in these 28 tables, of course, 
have been the basis for the tests of significance. 



SPECIFIC SUBTEST-C0I4PARIS0US YIELDING SIGNIFICANT 
DIFFERENCES TO SUPPORT HYPOTHESIS Id 



Class 


Comparison 


Sub test 


Composition 


III 


15 


Rliythm 


Haydn's 94th 


III 


16 


Knowledge of Form 


Haydn's 94th 


III 


17 


Total Test 


Haydn's 100th 


I 


19 


Harmony & Tonality 


Beethoven's Eighth 


III 


19 


Total Test 


Beethoven's Fifth 


II 


20 


Rhythm 


Haydn's 100th 



Of particular interest is a comparison of the magnitude of the 
values for each of the three groupings oi students. Note that tiie 
grand means for the general control subgroup are less than either the 
relevant experimental or relevant control subgroups in every case. 

It seems appropriate to repeat that the general control group never 
had the opportunity to study the experimental materials; therefore 
scores of these students were not used in the tests of the hypotheses 
even though these data do provide a useful reference. An examination 
of the schematic of Figure 1 quickly reveals that control over vari“ 
ables which was possible by comparisons bett^een the relevant experi- 
mental and control subgroups was not present in most of the comparisons 
between the experimental subgroups and the general control subgroups*. 



*The only situations in which the general control subgroup data 
(for example, those of Rg in Class I) could have been used properly 
to test a hypothesis were in comparisons 1 and 2. In the other cases, 
the variables described by the hypotheses could not be controlled, 
so that conclusions concerning the difference in treatment would not 
be valid. 



Table 9 



GRAND MEANS OF ALL OBSERVATIONS FOR TESTS OF EACH 
HYPOTHESIS COLLAPSING THE THREE CLASSES 



Grand Means 



Subtest 


Relevant Exp. 
Subgroup 


Relevant Control 
Subgroup 


General 

Control 






Hypothesis la 




Harm. & Ton. 


35.9 


34.7 


33.1 


K. of Form 


36.7 


32.6 


31.1 


Melody 


38.5 


36.6 


36.3 


Rhythm 


37.9 


37.1 


35.0 


Timbre 


36.7 


36.0 


34.6 


Total 


185.8 


178.4 


170.2 


Audio 


149.1 


145.9 


139.1 






Hypothesis lb 




Harm, & Ton. 


35.7 


35.0 


33.8 


K. of Fonri 


34.3 


32.9 


29.5 


Melody 


35.1 


34.4 


33.4 


Rhythm 


35.1 


35.9 


33.5 


Timbre 


36.5 


35.4 


34.0 


Total 


176.6 


173.6 


164.2 


Audio 


142.3 


140.7 


134.7 






Hypothesis Ic 




Harm. & Ton, 


35.2 


35.8 


33.7 


K. of Form 


32.9 


32.1 


29.5 


Melody 


35.3 


35.2 


33.8 


Rhythm 


35.5 


35.1 


33.5 


Timbre 


34.8 


35.5 


33.5 


Total 


173.7 


174.2 


164.1 


Audio 


140.8 


142.1 


134.5 






Hypothesis Id 




Harm. & Ton. 


36.0 


35.8 


34.1 


K. of Form 


32.8 


32.6 


29.9 


Melody 


35.6 


36.1 


34.5 


Rhythm 


35.9 


35.3 


34.1 


Timbre 


35.9 


35.5 


34.3 


Total 


176.3 


175.4 


166.4 


Audio 


143.9 


143.4 


136.9 







Comparisons with General Control Groups . It seems important at 
this point to call attention to two facts concerninj; the relationship 
of the experimental subgroups to the general control groups which 
should be kept in mir.d as one evaluates all comparisons made between 
them: 1) the general control groups and the experimental subgroups 

had the same initial ability in aural skills at the time the experi- 
ment began, and 2) the students in the general control groups never 
had access to the experimental materials. 

Hence, while comparisons betv/een the experimental subgroups and 
the general control groups cannot be used to determine the importance 
of the four factors articulated in hypotheses la, lb, Ic and Id such 
comparisons do give a measure of the effectiveness of studying the 
programed analyses. In other words, these comparisons reflect the 
total impact that the programed analyses have on students’ aural 
skills relating to audition of musical works but do not separate out 
the relative Importance of the influences spelled out in the hypotheses . 



Time Scores , A careful record was made of the amount of time - 
expressed in minutes - which each student spent listening to each of 
the three compositions. Student supervisors in charge of both the 
Record Listening Room and the Tape Listening Room collected time sheets 
from each student as he left the area. As has be -n noted in Giapter 2, 
this was possible since both listening rooms during the experiment were 
constantly under supervision, I'lhile no hypothesis was advanced con- 
cerning the relationship of time spent listening to each composition 
and the earned achievement score, the author felt it to be Important 
to gather such data in the event that a relationship could be estab- 
lished. 

Correlations were computed between the students* **time scores" 
(i.e., total time in minutes which a given subject spent in studyiir 
the composition) and achievement scores. A sample of experiraental 
and control subgroups provided data for this part of the study. The 
highest correlation obtained was -.14; the lowest coefficient was +.04. 

There was a considerable spread among students* time scores, as 
might be expected. Students in the groups assigned to the programs in 
most cases recorded a longer period of time studying than those who 
used the conventional materials*. It is of particular Interest to ob- 
serve that subjects assigned to the programed analyses recorded a 



*As an example, one **experimental" student studied the progratn 
built on Haydn’s 94th Symphony for nearly 20 hours. Another individual 
in the same block and class but in another experimental subgroup re- 
corded only two hours and fifteen minutes. 
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larger variation in tiiiie scores than those students who studiei- dislc 
recordings and the conventional reference books*. One vjould naturally 
anticipate that studying detailed programed analyses would, for college 
students, consume approximately similar amounts of time. On the other 
hand, when materials are provided which allots for the maximum leeway 
of usage, the expectation is that the amount of time spent studying 
these references would vary a great dea3. more than for programed 
materials. Speculation concerning the cause for this phenomenon will 
be made in the last chapter of the report. 

Of immediate concern to the investigator, of course, was whetaer 
or not time spent listening to the assigned works was a main source 
of variation in the students* achievement scores. Of particular 
interest was to determine whether or not observed differences in 
scores favoring the experimental subgroups could be attributed in 
large degree to the time these students spent at each assignment. 

This is especially true with respect to hypothesis la. 

The lack of correspondence as seen by the lov; correlations be- 
tween time and achievement scores x^as interpreted by the author to 
support the point of viev 7 that time per se did not contribute signi- 
ficant weight to test scores. In other words, it is doubtful, had it 
been possible to control for study time, that the results would have 
been modified to any significant degree. Admittedly, this conclusion 
must be considered to be tentative since there was no x^ay to deter- 
mine in this experiment the effect of time on scores. 

On the other hand, if it were possible to establish that an 
increase in study time by either method of study would inevitably 
lead to a significant increase in achievement, it would seem logical 
to seek x^ays and means to encourage students to listen longer. Thus, 
if the programs succeeded in acromplishing this state of affairs, 
certainly one would have to conclude that the programs are desirable 
from this point of viev; alone. 

Table 10 shox^s the means in minutes spent by students in the 
three subgroups for each of the three assignments. This table clearly 
reveals the decrease in numbers of minutes spent at listening as the 
semester proceeded. It would have been desirable, xjitli respect to 



^However, it was not uncommon among students listening to disk 
recordings in the general control groups to spend two to three hours 
of listening time for the assigned movement. Ore student in a control 
subgroup reported listening to the first movement of iiozart*s 40th 
Symphony a total of 5 1/2 hours. The investigator interviex/ed the 
student (ostensibly for another purpose) to verify his report and 
was subsequently satisfied that the student had indeed used the 
stated amount of time in serious study. 
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control subgroups. 



the goals of tlie study, had it been possible to require li\e perioas 
of time to be spent at each assignment, of course. No attempt was 
made by the investigator to urge students to spend specified amounts 
of time at their study since it was felt that such a procedure would 
affect the results of the study in an indeterminate \jay. It is 
quite likely, moreover, that the observed study patterns are not at 
all uncommon; results based on a controlled pattern of study tine 
therefore, might not have as wide a generalizability as one which is 
uncontrolled. 



The Analysis of Preferences 



It has been hypothesized (see hypothesis 2) that students v/ho 
have studied the listed compositions by means of the programed ana~ 
lyses would exhibit a different pattern of preferences for certain 
types of works as well as for particular compositions. Specifically, 
hypothesis 2 states that students in all the experimental subgroups 
will a) prefer absolute music more frequently than the control group 
and b) prefer those specific compositions they studied by the pro- 
gramed analyses more frequently than the compositions t^iiich they 
studied by conventional means. 

As explained in Chapter 2, tlie preference rankings (high = 1, 
low = 6) of the three programmatic compositions were obtained by 
tabulating the results of each of the fifteen possible paired com- 
parisons (6x5/2=15). In other v/ords, the frequency of choices of 
each pairing determined the ranks . 

Table 11 shows the preference ranking of three musical art forms 
for all three classes. Note that there is no difference among the 
classes; the rankings consistently show that the absolute compositions 
are ranked below the programmatic works although there is some vari- 
ation within the broad categories of absolute and programmatic. It 
is of considerable interest to note that Haydn's 94th Symphony, which 
has always been thought to be one of the perennial favorites of young 
people, occupies the lov7 rank in each class. (It was discovered 
that this composition was assigned the same rank regardless of hov/ 
the students were grouped.) 

Tables were constructed to reveal possible variations in these 
rankings not only by class but also on the two dependent variables s 
block and treatment. These tables are included in Appendix A; they 
clearly establish that these dimensions have little effect on the 
students* preferences, although some spotty minor fluctuations can 
be seen. 

A more precise method of determining the preferences was also 
made by assigning signed values to the choices in each comparison 



(fully explained in Chapter 2). Each preference for one of the 
absolute compositions in the pair was assigned a value of plus one 
(+1) and for the programmatic, a negative one (-1). Because six 
of the pairings involved like kinds of compositions, (either both 
programmatic or both absolute) the highest positive - or negative - 
score which could be obtained by this method was 9 • r_eans x^ere 
computed for the experimental subgroups and compared with the means 
for the general control group (the latter students had studied no 
p^Qgrams) • As can be seen in lable 12, there is very little 
difference among them; all are negative values, which indicate that 
the direction of their preferences is toward the programmatic music. 



Table 11 

PREFERENCE RANKINGS OF PROGIAi'HiATIC AND 
ABSOLUTE COIIPOSITIONS OF STUDENTS 
GROUPED BY CLASS 



Composition 




Rankinp; 




Class I 


Class II 


Class III 


Scheherazade^ 


1 


3 


1 


Romeo & Juliet^ 


3 


1 


3 


Die iloldau^ 


2 


2 


2 


ilozart’s ^Oth*^ 


5 


4 


4 


ilaydn’s 101s t^ 


4 


5 


5 


iiaydn's 9Ath^ 


6 


6 


6 




^prc gramma tic 


^absolute 





Because these data clearly demonstrated the nearly identical 
preferences of the groups and classes regardless of treatment, it 
was decided that significance tests were meaningless. In short, 
these data refute the hypothesis that the experimental treatment 
would modify preference of students for absolute music. 

In comparing those students who had access to the experimental 
materials for tx7o of the three absolute compositions V7ith those 
who had studied on’'/ one, the saiae kinds of scores emerge. The 
data shov/ing these comparisons are listed in Table 13. 



Table 12 



HEM SCORES OF EXPERIIIENTAL MD CONTROL GROUPS 
E}iPRESSIWG PREFERENCE FOR PROGRA^LiATIC 
VERSUS ABSOLUTE COilPOSITIONS^ 



Class 


General Control 
Group 


Experimental 

Subgroups 


I 


-5.31 


-4.75 


II 


-4.74 


-5.20 


III 


-5.48 


-4.78 


a 

negative 
pared to 


scores indicate preference 
absolute music. 


for program music com- 



Table 13 

MEAN SCORES SHOWING DIRECTION OF PREFLiOiNCES FOR 
ABSOLUTE VERSUS PROGRAILJATIC COMPOSITIONS 
OF EXPERHiENTAL STUDENTS GROUPED 
BY NUI4BER OF PIlOGRAjMS STUDIED^ 




Studied 


Studied 


Class 


One Program 


Two Programs 


I 


-4.7 


-4.8 


II 


-4.8 


-5.6 


III 


-5.0 


-4.6 



^negative scores indicate preference for program music com- 
pared to absolute music. 



The data in Table 13 reject the notion that more exposure to 
the analytical programs would effect a modification of preferences 
in the direction of the absolute music; there is no pattern to 
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substantiate any conclusion except that the treatment difference made 
no impact on preferences for this sample of students. 

The test of the second part of hy othesis 2 necessitated a 
separate procedure. This step had nothing to do with a student's 
preferences for absolute versus programmatic music but rather with 
his preferences for the compositlon(s) he had studied by the programed 
analyses. In order to quantify these preferences, when the student 
selected the composition he had studied by the programed analysis as 
bis choice, he was given a 4-1. VJhen a student selected a composition 
he had studied in the conventional way, a -1 was the score for that 
choice. Of course, this procedure of quantifying his preferences was 
followed only in the pairings in which at least one of the absolute 
composition (s) occurred. 

A small modification was necessary in scoring because all ex- 
perimental students either had studied one composition using tlie 
experimental materials and two with the conventional materials, or 
the opposite combination. Tlie scoring procedure used for those 
students who had studied two compositions using the experimental 
materials was equated with that used for students who had studied 
only one by doubling that part of the. score obtained from choices 
relating to the single composition. This was a step made necessary 
in order to cancel the spurious effect of larger negative or positive 
scores which might simply be a reflection of unequal numbers of pro- 
grams studied. In short, had all students been exposed to a like 
number of programs, this would not be a factor to consider. Tlie 
system of scoring described above yielded a maximum possible score 
(negative or positive) of 9 in both situations. 



Table 14 

tliSAil SCORES SHOEING DIRECTIOi.4 OF PREFERISKCES OF E.-TERIliEI-iTAL 
STUDEilTS FOR ABSOLUTE COid?OSITIOWS STUDIED WITH 
EXPERIIIEWTAL IIATERIALS VERSUS THOSE STUDIED 
WITH CX)HVEirnONAL MATERIALS^ 



Number of Compositions 
Studied with Experimental 




Class 




Haterlals 


I 


II 


III 


1 


-0.2 


-1.2 


-0.9 


2 


0,0 


-1.8 


-0.8 



Negative scores indicate preference for program music com- 
pared to absolute music. 



Table 14 reveals that the second part of hypothesis 2 likevjise 
is not supported by the data collected. The tabled values hover 
close to zero wliich can only be interpreted to mean the the relation- 
ship between the two methods (experimental and control) of studying 
the works had no or little effect on the experimental subgroups^ 
preferences. An examination of individual scores reveals some spotty 
variation but the lack of patterns discouraged the investigator from 
proceeding with further analyses. Wo relationships emerged between 
scores and such dependent variables as the specific treatment (see 
Figure 1), the compositions studied, nor the time of the seraester 
when the programs were presented. Detailed tabled values observed 
of the experimental subgroups of students grouped by treatments and 
blocks are Included in Appendix A (see Tables 49 and 50) . 

One must conclude from the data obtained by the preference 
rankings that choices for particular compositions by the subjects 
in this study vjere singularly unaffected by the experimental materials. 
Since it has been observed in the test of hypothesis la that the sub- 
jects who studied using the programed analyses performed significantly 
better than individuals who prepared themselves by using the con- 
ventional materials. Increased perceptions thereby achieved cannot 
be functionally related to preferences. The fact that this lack of 
correspondence was consistently demonstrated using tvjo separate 
criteria reinforces the conclusion. 



The Results of the Questionnaire 



It X'/lll be recalled that no hypothesis had been advanced prior 
to the beginning of the study concerning students' attitudes toward 
the experimental materials, l^en the experiment was in its late 
stages, it became apparent that casually observed variation in the 
subjects' affective responses indicated that a systematic evaluation 
would prove to be interesting and valuable. Of particular signifi- 
cance would be the students' point of view regarding the effectiveness 
of the programed analyses as teaching devices. Tlie procedure followed 
in collecting these data is outlined in the previous chapter; a copy 
of the questionnaire is included in Appendix B. Conclusions are 
based on a one hundred per cent response. 

l‘lost of the students apparently were serious about the oppor- 
tunity they had to express themselves about their experiences; the 
author Interpreted the high incidence of commentary on the forms as 
an indication of their desire to be as accurate as possible. 

The investigator followed up the questionnaires by interviews 
with a small sample of students to test the validity of the form; 
this check satisfied him that the questionnaire performed adequately 



as intended. Below is a summary of the findings based on the tabu- 
lated responses; response frequencies are included in thirteen tables 
in Appendix A (see Tables 51 to 63) . 

1. An argument sometimes made by educators against the use 
of tapes and tape decks is that such facilities are 
difficult to use in comparison to disk recordings and 
turntables. Tlie data in Table 51 (Appendix A) do not 
support this point of view. Although one-fourth of 
the students did feel that using records was less 
difficult, an equal-sized proportion felt it was easier 
to use tapes and tape machinery. The largest number of 
subjects - 50 per cent - agreed that the tape adjunct 
was no more difficult to manipulate than the conventional 
record-and-turntable facility. 

2. Table 52 in Appendix A shows that a large portion - 87 
per cent - oi the 179 students who were exposed to the 
experimental materials felt that the prograras were help- 
ful to them in learning to listen to the compos i tion (s ) . 
Furthermore, only three per cent felt they could have 
learned more had they used conventional recordings. 

The author thought it to be unnecessary to make an attempt 
at scaling affirmative replies to the question (see 
question 2 of the questionnaire in Appendix B) ; precision 
did not seem to be important here in light of the fact 
that scores on the achievement tests are the main cri- 
teria. Tlie investigator interpreted the heavy affirma- 
tive response to mean that the students had confidence 
in the programs as teaching devices in learning to hear 
specific musical sounds. 

3. A common problem with beginners is that they fail to 
comprehend the significance of verbal symbols used in 
music. Many expressive terms and the contexts in which 
they are used to convey meaning are borrowed from other 
disciplines or analogous situations. Without an ade- 
quate background of associations and experiences upon 
which to draw, verbal expressions as they relate to 
the musical sounds are often devoid of meaning for the 
individual. The students in this study apparently felt 
they were helped significantly by the programed analyses, 
for ninety per cent selected the affirmative response 

to the query concerning the teaching effectiveness of 
the materials in this regard (see Table 53 in Appendix 
A) . Even though the main objective of the programs is 
to develop aural perception, an understanding of verbal 
"pointers” used to describe the sounds apparently is 
also a valuable experience to the learners. 
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4. The investigator is aware that nany students prepare 
listening assigntiienta in a seewingly casual manner even 
though they know they are to be tested. (The reason 
for this behavior Is generally thought to be that be- 
ginning students of music simply do not know hoxiy to 
study a composition.) Therefore, he was not unpre- 
pared for coiraaents of dismay at the amount of time 
they found it necessary to spend in reading the pro- 
grams • The crucial point , however , is : vjsls the time 
spent worth it in terms of results? The data in 
Table 54 (Appendix A) show that 48 per cent thought it 
probably i^as and another 27 per cent were sure of it. 
Therefore, three-fourths of the group agreed that 
benefit gained from the materials merited the time 
spent obtaining it. 

5. It has been pointed out in the preceding chapter that 
most of the frames alluding to the actual sounds of the 
composition Include line score notation of the specific 
thread of sound being discussed. The purpose of the 
tape adjunct, of course, is to reduce the necessity for 
music symbols in malting references to the structure of 
the work, and instead, to place primary emphasis on 
audio cues for understanding. Nonetheless, as seen in 
Table 55 (Appendix A) , over eighty per cent of the ex- 
perimental subjects replied that the music notation 
references were helpful, and, of that number, nearly 
half felt that without the notation they would not have 
learned as much as they did. 

6. The author feels that one's ability to perceive the 
patterns of music visually in the orchestra score is 
useful (but not essential) to one's understanding of 
the music's form. To this end, the programs are con- 
structed to make frequent and easy references to the 
orchestra score. Not quite two-thirds of the experi- 
mental students stated that the programed materials 
helped them learn to follow the orchestra score of the 
composition they studied. Approximately half of these 
felt sure that they profited in this maimer. However, 
a like number responded that the materials had no 
effect on their ability to follow the notations in the 
orchestra score. The relatively high frequency of 
"favorable" response to this question was not expected, 
however. It would have been interesting to discover 
v^ether or not this response reflects a real increase 
in their ability to follow the score (see Table 56, 
Appendix A). 
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7. In many Instances, it was deemed necessary to separate 
specific strands of the sound’s mass in order to call the 
students' attention to the contribution of a particular 
sound in the total configuration. In textbooks, these 
references are usually made by notation only. In the 
programs, these vjare performed on the pianos the con- 
cern here x/as the effectiveness of this medium of pre- 
sentation instead of the orchestral instrument (s) to 
which the composer originally assigned the music. Also, 
another aspect of this procedure is the effectiveness of 
the ’’separation-of-sounds*' technique Itself in learning’ 
to hear the music and its design. In response to this 
question, 90 per cent of the individuals said thay were 
helpful or helpful some of the time. Table 37 (Appendix 
A) shows that only ten per cent of the students felt this 
method was not effective. 

4 

3. The author argues that one of the strengths of th.i pro- 
gramed instruction method Introduced in this Investigation 
is that it is easy for the student to adapt the program to 
fit his ovm particular background of abilities. lie may 
repeat the frames in the text or the taped references as 
often as he feels it to be necessary to get the full inpac: 
of the material. He can lightly skim parts of the program 
which are readily assimilated and/or he can bear doxm 
harder at those places which are more elusive. The latter 
is particularly true with respect to the musical references 
Finally, certain sections of the tape adjunct are parti- 
cularly good for reviex7 purposes (especially the "synthesis 
portion) . Hearly ninety per cent (see Table 58 in Appendix 
A) of the students were aware of this feature of the 
Toaterials and a majority of these said they made use of 
the programs in that way. 

9. The students were solicited concerning their opinion of 
the value of using the programs and adjunct tapes 
regularly in the course they had just completed. Tlie 
primary purpose of this ques ;ion was, of course, a means 
of ascertaining the students' attitude tovjard the general 
transfer value of the programs to other learning ex- 
periences and their significance as a part of the course 
of study. Table 59 (Appendix A) shows that 43 per cent 
strongly recommended that programs such as these be intro- 
duced as a part of Mu 112 experiences? only tv;o per cent 
responded negatively, while the majority (55 per cent) 
felt that some students would find them profitable. Res- 
ponses to question 11, which pertain directly to transfer 
effect of these learning experiences, show quite close 
agreeoient x^ith these totals . 
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10. The students* attitudes toward the structure of the pro- 
grams with specific regard to the size of the steps 
(frames) were interesting. Table 60 (Appendix A) reveals 
that only one student was of the opinion that the pro- 
grams proceeded too rapidly. I'Jhile ten per cent thought 
they moved too slowly, S3 per cent felt they were ’’just 
right” or suitable most of the time. Of the eighteen 
students who replied that the programs moved too slowly, 
twelve were in the upper half of the EIIAT score distri- 
bution; these data probably signify that their reaction 
is more a function of their general musical ability than 
a fault of the programs. (Tlie other six students simply 
may have resented the amount of time necessary for them 
to complete their listening assignment.) The fact that 
more than eight of every ten students reacted favorably 
to the graduation of steps is interpreted by the author 
to indicate that the students felt they had no difficulty 
in assimilating the subject matter content. 

11. As mentioned in point 9 above, a majority of students 
felt that the programed analyses should become a part 
of the regular course of study in Mu 112. Question 11 
asks the students for a direct evaluation cf the programs' 
utility in aiding them in other learning situations in 
the course. Thus, question 9 is a general appraisal of 
the materials in the study of music literature; now the 
students are being requested to be more specific about 
the practicality of these devices judged by their transfer 
value. Table 61 (Appendix A) shoi^s that exactly twice 

as many individuals felt there vras a noticeable transfer 
effect to other situations in Mu 112 as those who felt 
the transfer v;as negligible. An examination of the res- 
ponses of the students xfhen the individuals are grouped 
by blocks (see below) revealed that there was no corres- 
pondence between musical ability as measured by the EIIAT 
and the students' estimate of the programs' transfer value. 





Blocks 1 

1-5 


Blocks 

6-10 


Totals 


Noted 

transfer 


51 




100 


Did not 
note 


24 


26 


50 


transfer 








Totals 


75 


75 


150* 



*29 students chose 
another response 
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One V 70 uld naturally have anticipated that students with 
a weaker background of skills in music would have felt 
more strongly that there was transfer value than was 
demonstrated. 

12. T'Then students were asked to check which aspects of the 
printed part of the programs they felt were features of 
the texts, the largest number of answers x^ere in res- 
ponse to the phrases "covered important material" and 
"clear and concise". Table 62 (Appendix A) shows the 
percentage of response to those itenis were 76 and 74 per 
cent respectively. Hote that only 30 per cent of the 
students checked "interesting" as a feature*. On the 
other hand. Table 63 (Appendix A) shox/s a much more 
regular and frequent response pattern to listed features 
of the tape adjuncts Three-fourths or more of the 
students responded to almost every listed "desirable" 
characteristic listed on the form descriptive of the 
tapes . 

Responses of the students also were grouped so that it v;as 
possible to compare students in various combinations of treatment 
subgroups as well as by class and block. No unique patterns of re- 
sponse eiQvjrged from these cross -tabulations. In other x^rords, it 
is apparent that the students’ feelings about the effectiveness of 
the programs could not be traced to their unique background of 
abilities prior to registration in the course, to the effect of a 
particular instructor, to the specific program (s) studied, nor to 
the number of programs read. In short, the variations found in the 
response pattern of which students had formed concerning the values 
of this instructional tool (other than the main effect of the ex- 
perimental treatment) were generated by unknoxi/n variables, possibly 
such as earlier happy or unhappy contacts with programed instruction 
biases about the value of music instruction in general, and so forth 
It is probably safe to say that other samples of individuals X7ith 
similar training taken from other populations x^ould react in much 
the same way. 

Exactly half of the students either took time to elaborate on 
their reactions or accepted the invitation at the end of the ! 

questionnaire to make general comments. Most of the commentary 



^Unfortunately, it is not possible to determine xahether they 
considered this to be a fault of the text or the composition upon 
vThich it is based. Judging from the students* strong preference 
for programmatic music, one probably should conclude that preference 
for the latter is the main contributor. 
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was favorable and supported the continued use of the experinental 
materials; some students were noncommittal and pointed out strengths 
or weaknesses in an objective way; a very small proportion of the 
individuals were critical and expressed a general antipathy con- 
cerning the use of the programs. 

Of the students who were critical, the major number commented to 
the effect that the programs were time-consuming. One student stated 
that he felt the material "seemed repetitious and boring^ . The stu- 
dents who gave objective replies about the disadvantages of the 
materials remarked about flaws such as "the noise in the classroom 
next door" or "there should be more stations". 

Roughly 75 per cent of those students who returned comments on 
the form made favorable responses . Most of these were expressions 
concerning the value of the "tapes" in aiding them to hear the music 
and to understand more of "vjrhat is going on in the music". Several 
comments such as the following were received: "I really feel that 
learning by tapes is advantageous and of great value in understanding 
f ully the music. Definitely continue using them in Music 112". An 
interesting point v7as brought up by a small number of students to the 
effect that they felt the achievement tests didn’t measure the amount 
of learning which they had made as a result of studying the programs. 



Summary 



'.rhis chapter presents In detail the processes by v/hich the data 
collected in the investigation to test the hy'/othesvas were analyzed. 
The results of these tests have been presented and interpreted and 
Inferences drawn relating to the stated hypotheses. It has been 
shown that one of the major problems vjhich developed in this study 
was that the measuring Instruments constructed to test the four 
hypotheses grouped under 1 (la, lb, Ic, and Id) \ 7 ere found to have 
unusually low reliability indices, particularly the audio subtests. 
(The total test and the knowledge of form sub test had the highest 
indices.) Thus, conclusions based on the significance tests, 
particularly with reference to hypotheses lb, Ic, and Id, must re- 
main tentative. The following observations, based on the data were 
of central Importance as they relate to the objectives of the ex- 
periment : 



1. Data obtained by the EilAT pretest showed that the three 
classes were very much alike on this criterion. From 
an examination of these data, it was clear that the 
variations within each of the classes based on aural 
acuity as measured by the ELiAT were similar. 
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The frequency of significant differences found between the 
relevant experimental and control subgroups on the criteria 
is interpreted to Indicate the superiority of the programed 
analyses used for preparing out-of-class listening assign- 
ments compared to the conventional equipment and facilities. 
These x<rere found in the students^ aural perception of 
harmony and tonality and rhythm, but not so frequently with 
respect to timbre, and never to melody. Knov? ledge of form 
subtest scores and total test scores revealed more frequent 
differences between relevant experimental and control sub- 
groups than any one of the other subtests. 



3. The transfer effect of a programed analysis of a composition 
in a particular form (i.e., sonata, rondo, etc.) to a com- 
position not programed but in the same form as the one 
studied could not be established. To put it another way, 
the power of a programed analysis did not carry over for 
the learner into a different composition even though it was 
in the same form based on the criterion used. Unfortunately, 
as mentioned above, the sensitivity of the criteria was re- 
duced by their low internal consistencies. With more reli- 
able measures, it may have been possible to note the trans- 
fer effect posited. 



4. Wo decision was reached concerning the value of studying two 
programs compared to the effect of one such exposure on 
transfer to other works. Although on ten occasions, signi- 
ficant differences were found betiireen the relevant experi- 
mental and control subgroups, the lov/ ratio of these 
differences to comparisons tested made it difficult to 
arrive at a firm conclusion. It was heartening to note, 
however, that the larger proportion of the differences 
occurred beti/een means based on the audio subtests, strength- 
ening the proposition that aural perception within the con- 
text of a musical art form could be developed by auto- 
instruction. 



5. The findings do not support conclusively that relative 
early or late placement of the programed analyses in the 
school term make a substantial difference on subsequent per- 
formance with regard to works not audited prior to testing. 
However, as pointed out^ above, significant differences did 
appear again in the audio sub tests, v/hich fact encourages 
one to believe that skills in aural perception can be 
developed by auto-instruction and transferred to other com- 
positions not previously audited. 

6. Comparisons made between the experimental subgroups and the 
general control subgroups, while admittedly they could not 




- 64 - 










be used to test tl.e Hypotheses directly, clearly revealed 
that tlie use of the programs caused a systematic difference 
to emerge among these groups c Using this evidence, one ^ 
cannot determine which element or elements of the use of tae 
prograuied technique resulted in these differences. None- 
theless, the high proportion of differences among the means 
in the hypothesized direction shows that the use of the 
prograiTis do make an impact on students* aural perception. 

Preferences for particular compositions were not affected by 
the programed analyses. Observations based on other groupings 
(treatment, block, class) resulted in similar rankings: con- 
sistently the analysis revealed that the programmatic works 
were preferred over the absolute forms. Apparently, the 
amount of knowledge an individual had accumulated about the 
form and structure of compositions made no difference in his 
preference rankings. Thus, students who had studied a com- 
position by the programed analysis did not consistently shovj 
any different pattern of preferences than those in the sample 
groups who studied the same works conventionally. Persons 
with high scores on the EIFAT pre-test (both experimental and 
control sections) demonstrated the same preference patterns 
as those with low scores. 

A substantial majority of the students who had studied the 
experimental materials felt that the programed analyses were 
effective means of learning how to listen to a liiusical com- 
position. The majority stated that the experimental materials 
were more valuable to them than the conventional facilities 
and procedures used in preparing outside listening assignments. 





CHAPTER IV 



SUIUiARt AND RECOi-ttlLNDATIONS 



The central purpose of this section will be to consider the 
results of the experiment in an attempt to formulate generalizations 
regarding the instructional value of the use of linearly programed 
analyses of musical works vjith taped musical references. Another ob- 
jective in this part of the report is to make a translation of the 
observations into the practical significance of these tools as they 
flight contribute to the attainment of educational goals. Suggestions 
will also be made concerning the value of and need for further in- 
vestigation and experimentation relating directly and indirectly to 
the problem of teaching individuals to listen intelligently to 
musical art forms. 



Review of Rationale 



It seems to be important that instructional tools be aeveloped 
to reach individuals V'jithin a wide age range v/ho possess varying 
degrees of musical ability and aptitude and who do not (or cannot) 
expect to make an Intensive study of music. The general musical 
education of this large target population, in contrast to the rela- 
tive few who look forward to a serious study of music*, is a matter 
of increasing concern to music educators. Because of the rising en- 
rollments in our public schools and colleges, introductory classes 
in music literature (often called "music appreciation" courses) are 
necessarily being increased in size to accommodate the gro^i)^ing 
student load. Obviously, ways must be sought to adjust instructional 
modes so that an increase in student numbers does not necessarily 
have to mean a corresponding reduction in teaching effectiveness. 

An important value of the programed instructional materials is 
that they are self-teaching devices. The introduction of this tech- 
nique into the course of study frees the instructor to develop in his 
large classes subtle and/or detailed aspects of his course which may 
not be so economically treated by means of programed devices , but 
can be the subject of lectures to large groups of students. 



*Those who major in music norraaliy meet in comparatively small 
classes and/or are tutored individually, especially so in applied 
music. 



Tape transports and lieadsets are "ancrally not expensive T\or 
technically oophlsticated instructional nodules. Ordirarily tape 
transports can be Installed as an integral part of the play -back 
apparatus of any amplification unit by connecting it in parallel with 
the record turntable*. 

It was expected (and demonstrated) that students could be led to 
achieve a deeper understanding of the form of a particular design when 
they are guided by a program than when assigned to conventional materials 
and aids. It was also expected that individuals would develop a 
keener auditory awareness of some of the components which shape the 
form of the composition as well. The latter, of course, is of 
particular Interest to the music educator since it is a student *s 
aural responsiveness which is critical in his understanding of music 
literature. In short, the individual must become perceptive of the 
variations within the sounds themselves as the primary source for his 
comprehension and appreciation. To provide a means of self-instruction 
for the musically naive student so that he himself is in a position to 
refine his aural comprehension relating to a specific musical work 
seems to be a significant addition to educational procedure. 



Features of the Experimental Materials 



The author lists the following significant features of the ex- 
perimental materials which seem to be of particular value in helping 
to develop an individual’s skill in listening to music; 

1. The student has maximum control over the use of the 
programed materials to compensate or adjust for his 
own capabilities and interests. This is especially 
true with respect to the use of the taped music ref- 
erences used in conjunction with the score (this 
flexibility is not present to such a marked degree 
in the printed text of the programs , of course) . 

Since each of the taped references deals with one idea, 
it is a simple and effective procedure to re-wind the 
tape to the desired section for as many replays as 
necessary until the ear grasps the outline of the sound. 



*Iia 2 cimum benefits from these or similar programed materials are 
realized x/hen students work individually. In some cases, however, when 
students can xTOrk together at approximately the same rate of speed 
and have a similar background of aural skills, it is possible that 
a station could accommodate more than a single individual at the same 
time. 



